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MERRILL SWAIN 


and 


HENRI C. BARIK 


Ontario Institute for Studies in Education 


The Role of Curricular Approach, Rural-Urban 
Background, and Socioeconomic Status 

in Second Language Learning: 

The Cornwall Area Study* 


This paper presents the results of the evaluation of a kindergarten level 
bilingual education program operating in Cornwall (Ontario) and surrounding 
regions, and of a follow-up 40-minute-a-day French option program (evaluated 
in Grade 2). 

The kindergarten evaluation focussed on three dimensions: (1) curricular 
approach (alternate half days of English and French instruction vs. alternate 
full days of English and French instruction), (2) SES level (low SES background 
us. middle-upper background), and (3) locale (urban, semi-urban, rural). The 
Grade 2 evaluation examines the effect of having taken 40 minutes a day of 
French instruction in Grades 1 and 2 on English language skills and 
mathematical skills. It also examines performance in French comprehension of 
the Grade 2 students who had the bilingual kindergarten experience followed by 
two years of the 40-minute French option relative to the performance of students 
in other bilingual French programs and in other core French programs. (Dr. 
Swain and Dr. Barik are with the Modern Language Centre, Department of 
Curriculum, O.I.S.E.) 


A significant educational innovation in Canada in recent years has been 
the establishment of bilingual education programs for English-speaking 
students. These programs involve the use of French as the language of 
instruction for all (“total immersion”) or part (‘partial immersion’’) of the 
curriculum during one or more years of schooling. 

To a large extent, the impetus for these programs has come from the recent 
emphasis on bilingualism in many aspects of Canadian society. One of the 
pioneering programs was that initiated in 1965in St. Lambert, on the outskirts 
of Montreal, which has been carefully monitored over the years (see, e.g., 
Lambert & Tucker, 1972; Bruck, Lambert & Tucker, 1974). In that program, 
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English-speaking pupils receive their schooling completely in French 
throughout Kindergarten and Grade 1. Beginning in Grade 2, English 
Language Arts are taught for approximately one hour per day. With each 
successive grade, the proportion of time taught in English is increased so that 
by the end of elementary schooling approximately half the curriculum is 
taught in English and half in French. In the wake of the success of that 
program, a number of similar programs have been started and evaluated in 
almost every major region in Canada (see, e.g., Barik & Swain, 1976a; Swain & 
Barik, 1976; Edwards & Casserly, 1976; Gray & Cameron, 1976; Genesee & 
Chaplin, 1975; Shapson & Kaufman, 1976) and alternative programs have also 
been initiated to introduce the immersion formula at upper grade levels (e.g., 
Barik, Swain & Gaudino, 1976; Genesee, Polich & Stanley, 1977). Summaries of 
research findings and issues relating to immersion programs in Canada are to 
be found in Swain (1974), Swain and Bruck (1976), and Harley (1976). 
Immersion programs have also been started in the United States (Cohen, 
1974). 


To date the vast majority of French immersion or related programs in 
Canada have been initiated in urban centres and have in general drawn 
students from the middle to upper-middle class strata of society, although 
immersion programs have also been started with equal success in urban 
working class districts (Tucker, Lambert & d’Anglejan, 1973). A program 
initiated in Cornwall (Ontario) and surrounding areas departs from these 
other programs in that it involves both urban and rural centres and draws a 
substantial segment of its population from lower and lower-middle class areas. 
It also includes two approaches or formulae to bilingual education, the 
“alternate half days” and “alternate full days” approach, as described 
subsequently. The present paper discusses the findings of the evaluation of the 
Cornwall area program in 1975-76. The evaluation was carried out by the 
Bilingual Education Project of the Ontario Institute for Studies in Education, 
which is involved with a number of bilingual education programs throughout 
Ontario. 


Background Information and Description of Program 


Cornwall is an industrial city of approximately 46,000 inhabitants which is 
located in eastern Ontario, near the Quebec border. Approximately 60% of its 
inhabitants are native speakers of English and 40% native speakers of French. 
Due to the composition of its population and its closeness to the bilingual 
centres of Ottawa and Montreal, bilingualism in Cornwall is an important 
issue. 


In September 1973 the Stormont, Dundas and Glengarry (SDG) County 
Board of Education, which has jurisdiction over the English school system in 
Cornwall and surrounding regions, initiated a bilingual education program at 
the kindergarten (K) level in eight schools (and 10 classes), with half the 
curriculum being taught in French and half in English. Two formulae were 
followed, alternate half days and alternate full days. In the alternate half days 
approach, instruction is in one language for half the day (A.M. or P.M.) and in 
the other language for the other half, every school day of the week. The 
assignment of the two languages to morning or afternoon session varied 
among schools. In the alternate full days approach, instruction is in one 
language for the duration of one day and in the other language for the duration 
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of the following day, with the pattern repeated subsequently. In terms of 
amount of contact time with the two languages, the two approaches are thus 
equivalent. The alternate half days (HD) approach was employed in schools 
within the city limits of Comwall, while the alternate full days (FD) approach 
was followed in rural areas and in other schools outside the city limits. (In 
subsequent years, the FD approach was also adopted by some city schools.) 


In terms of amount of contact time with French (half a day per day), both 
HD and FD programs are thus equivalent to the total immersion half-day K 
programs followed in several other localities (see, e.g., Barik & Swain, 1975, 
1976a; Lambert & Tucker, 1972), and the HD approach is similar to one 
initiated in Ottawa by the Ottawa Roman Catholic Separate School Board first 
at the K level (Edwards & Casserly, 1976) and since then extended to further 
grades (Edwards, Doutriaux & McCarrey, 1976). 


Each language component of the program is taught by a different teacher 
with native or native-like fluency in the language in question. The amount of 
overlap in content materials is kept to a minimum, but by the nature of early 
childhood education, some overlap does occur, for example, in teaching 
mathematics concepts or in routines within the class or special themes (e.g., 
Christmas). However, each component has its own curriculum with its series of 
themes. 


The bilingual education program of the SDG Board of Education is at 
present (1975-76) restricted to the K level. In September 1974, the children 
graduating from either the HD or FD program entered an English Grade 1 
program which allowed for 40 minutes per day of instruction in French as a 
second language. Likewise, in September 1975, they advanced to a similar 
Grade 2 program. In Grade 1 and Grade 2, thereis no strict separation of pupils 
who followed the bilingual education program in K from those who did not 
have any French prior to Grade 1.1 It may be anticipated, therefore, that the 
French experience acquired by pupils coming from the K bilingual education 
program will have been somewhat “diluted” by the end of Grade 1 or Grade 2. 


At the K level, however, the bilingual education program has continued to 
exist, and in fact in September 1975, it was extended to the total K population 
served by the SDG Board of Education within the city limits of Cornwall, as 
well as to a number of other schools outside the city limits. 


At the inception of its K program in 1973, the SDG Board entered into an 
agreement with the Bilingual Education Project of the Ontario Institute for 
studies in Education to evaluate the program. The basic concerns at the time 
were (1) whether pupils in the program (following either HD or FD approach) 
were as ready by the end of the school year to enter Grade 1 as pupils who were 
enrolled in the regular K program, which consisted of half-day schooling in 
English only, and (2) the determination of their level of French proficiency 
relative to pupils enrolled in other types of programs (e.g., total immersion, or 
regular program with daily instruction in French). 


Due to a number of factors, an extensive evaluation of the program could 
not be carried out in the first two years of the program. The partial analysis 
carried out in 1973-74 revealed that there was no overall reliable difference 
between pupils in the bilingual education program and equivalent groups of 
pupils attending a regular K program relative to their readiness for beginning 
school work in Grade 1. In terms of level of proficiency in French listening 
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comprehension when placed in relation to pupils in other localities, pupils in 
the SDG bilingual education program scored approximately the same as pupils 
in a similar (half French, half English) program in Grade 1 who had had no 
French in K and had therefore had the same amount of contact time with 
French as the SDG group, as well as pupils in a regular Grade 1 program who 
had had 20-40 minutes of French per day throughout K and Grade 1 and had 
thus had less contact time with French on a cumulative basis than the SDG 
group (27.1 out of 62 for the SDG pupils, 28.2 for the Grade 1 bilingual program 
pupils, 29.4 for the Grade 1 regular program pupils), but the SDG pupils scored 
substantially lower than pupils in a K total French immersion (half-day) 
program (36.5 out of 62; non-SDG scores reported in Barik & Swain, 1976b). 

Similar results were obtained on the basis of an evalutaion in spring of 
1975, in which the same test of French listening comprehension was 
administered to samples of pupils in K and Grade 1. The Grade 1 pupils were 
from among those who had attended the K bilingual education program the 
previous year (Cohort I) and who were receiving 40 minutes of French per day 
in Grade 1, while the K pupils represented the second stream of children 
entering the bilingual program (Cohort II). The latter obtained a score of 29.5, 
similar to that of the preceding K group, while the Grade 1 pupils scored 
somewhat higher (34.2).? 


1975-76 Evaluation 


Sample. In Spring 1976, it was decided to evaluate the new K group 
(Cohort III) and also look at the initial cohort (Cohort I), which was then in 
Grade 2. Since, as mentioned, the bilingual education program was in 
operation throughout most of the K classes served by the SDG Board, the 
focus of attention in K shifted from a consideration of the bilingual education 
program relative to the regular program to one of considering three 
dimensions within the bilingual education program: curricular approach, 
locale, and socioeconomic level. 


1. Curricular approach. Alternate half days (HD) vs. alternate full days 
(FD). A group of K pupils following the HD approach was compared with a 
group of K pupils following the FD approach. Both groups were from urban 
area schools (Cornwall) of similar socioeconomic level (low SES). The HD 
group involved one class (16 pupils) and the FD group, two classes within a 
school (34 pupils). 

2. Locale. This study differs from that of many other innovative programs 
of French as a second language in Canada (such as total immersion), where 
students come primarily from middle and upper-middle class families located 
in urban centres, in that it extends the research to semi-urban and rural 
areas. In this context, it relates partly to a study carried out in Great Britain 
by Burstall, Jamieson, Cohen and Hargreaves (1974) which considered the 
school achievement of pupils who were undertaking studies in French as a 
foreign language in urban as well as rural centres. Burstall et al. found that 
rural area children performed as well and sometimes better in academic or 
specific subject areas. (Their pupils were older than the SDG group, being 8 or 
11 years old at the start of the investigation.) 


In the present study, three locales were defined: (a) urban (within the city 
limits of Cornwall), (b) semi-urban (small size town outside city limits), and (c) 
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rural (pupils coming from a predominantly agricultural environment). To 
consider this dimension, classes were selected from each area specified (two 
classes within the same school for the urban group, one class in the case of 
each of the other groups), with curricular dimension held constant (FD in all 
instances) and socioeconomic level varying between low-middle and middle 
class. (It was not possible to hold this factor constant across the three areas.) 
The numbers of pupils involved, once a few pupils who did not belong to the 
category specified were excluded from the analysis (12 pupils altogether), 
were: urban, 26; semi-urban, 14; rural, 16. The semi-urban and rural groups 
went to the same school, the semi-urban group consisting of pupils who lived 
in the town and were not bused to school, and the rural group of pupils from 
the surrounding region who were bused to school. (The two groups constituted 
separate classes within the school.) 


3. Socioeconomic level (SES). As stated previously, most innovative 
French programs have been developed in the context of middle to upper- 
middle SES environments. The question has been asked whether they may be 
equally effective in lower SES settings where there may not be the same level 
of parental support for or involvement in the program. With respect to total 
French immersion, that issue has been considered by Tucker, Lambert, and 
d’Anglejan (1973) and Bruck, Jakimik, and Tucker (1971), who found that 
relative to peers in the regular English program from the same SES milieu, 
immersion pupils in a low SES environment benefited as much from the 
immersion program as did pupils from more favoured backgrounds. These 
two studies, however, did not directly compare different SES groups of French 
immersion pupils, as is the case in the present study. It should be noted that 
in the Grade 1 bilingual education program evaluated by Edwards, 
Doutriaux, and McCarrey (1976), comparisons were also made between pupils 
from low SES and middle-upper SES backgrounds, and some significant 
differences were observed between the two; however, the overall picture led 
the investigators to conclude that there was no evidence of detrimental effects 
of the bilingual program for low SES pupils. 

Further research on SES level as it relates to French immersion has also 
been carried out by Cziko (1975) and is discussed by Genesee (1976). 


In the present study two levels were specified: low SES and middle to 
middle-upper (MU) SES.° One school was selected to represent each level, 
with both curricular approach (HD) and regional area (urban) held constant. 
One class (16 pupils) was involved in the low SES school, and two classes (27 
pupils) in the MU-SES school. (The low SES class in this design was the same 
as the HD class in the consideration of curricular approach.) 


The SES levels of the schools were specified by the SDG Board authorities. 
To determine the accuracy of these labels, information was obtained on the 
father’s (as well as mother’s) occupation for all pupils in the sample. This 
material generally confirmed the validity of the SES assignments, even if 
within each class a few pupils came from a home SES background 
unrepresentative of the class as a whole. To quantify the SES concept, the 
occupation of the head of the household (father or, in some instances, mother) 
in the case of each child was assigned a value on the basis of the 
classification scheme of occupations developed in a Canadian context by 
Blishen (1958). Occupations on this scale are grouped into seven classes from 
a “high” of 1 (e.g., doctors, judges, architects) to a “low” of 7 (e.g., janitors, 
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labourers, cooks), based on years of schooling and income. The average rating 
of the low SES group was 5.5 while that of the MU-SES group was 2.8, thus 
indicating a clear distinction between the two groups on the SES dimension. 
(The FD low SES group involved in the curricular approach comparison, item 
1 above, had an SES rating of 4.6.)4 


Kindergarten 
Test Battery 
The following tests were administered in spring of 1976: 


1. Otis-Lennon Mental Ability Test (Primary 1 Level, Form J). This 
general intelligence test measures the pupil’s facility in reasoning and in 
dealing abstractly with verbal, symbolic, and figural materials covering a 
broad range of cognitive abilities. Taking age into consideration, raw scores 
are converted into standardized Deviation IQ (DIQ) scores, which provide an 
index of mental ability. 


2. Metropolitan Readiness Tests (Form A). The Metropolitan Readiness 
Test has six different sections designed to measure the child’s readiness for 
school work in Grade 1: (i) word meaning, a picture vocabulary test; (ii) 
listening, a test of phrase and sentence comprehension; (iii) matching, a test 
of visual perception involving the recognition of similarities; (iv) alphabet, a 
test of ability to recognize lower-case letters of the alphabet; (v) numbers, a 
test of number knowledge; and (vi) copying, a test of visual perception and 
motor control. 


3. French Comprehension Test (Primer). This is a test developed by the 
Bilingual Education Project to measure the child’s comprehension of French, 
and is intended primarily for use in total French immersion K programs 
(which compare with the SDG program in terms of amount of contact time 
with French). The test consists of three parts. The first, words and sentences, 
requires the child to identify the picture referred to by a spoken French word 
or sentence. The second, questions, measures the child’s ability to choose the 
correct pictorial answer to a question in French. The third, stories, requires 
the child to choose the correct pictures in answer to questions concerning 
short stories which have been read aloud in French. 


The testing was carried out by consultants of the SDG County Board of 
Education. 


Design of Analysis 

In the comparisons that follow, differences between the groups considered 
were analyzed through one-way analysis of variance as well as through 
analysis of covariance using age and IQ as covariates. The data of three 
pupils who either had only covariate scores or who lacked such scores were 
excluded from the analysis. 


Results 


A. Curricular Approach. The data for the analysis relating to curricular 
approach (HD vs. FD) is shown in Table 1. 


1. Age and IQ. As seen, there are no reliable differences between the two 
groups relative to age or (Deviation) IQ. On the latter measure, the HD group, 
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TABLE 1 


KINDERGARTEN, CURRICULAR APPROACH: HALF DAYS (HD) vs. 
FULL DAY (FD) PROGRAM; UNADJUSTED AND ADJUSTED MEANS 


Unadjusted Means Adjusted Means (covariates=Age + DIQ) 
HD Group FD Group F ratio df HD Group FD Group fF ratio df 
(N=15) (N=34) 
Age (months, end year) ROS27 70.24 0.00 1/47 
Otis-Lennon Mental 
Ability Test--DIQ 110.67 114.50 Oa 1/47 
Metro. Readiness Test 
Sted (102. 4bens) 65.87 62.32 One 1/47 68.12 61.38 9.08%* 1/45 
a. Word meaning (16) 8.33 shyly 0.08 1/47 8.61 8.00 0.90 1/45 
b. Listening (16) IOP E nS) 10. 36 0.10 1/47 10.34 10.26 0.02 1/45 
c. Matching (14) LORSS els Deahs IG 10/59 8.86 8.33%** 1/45 
d. Alphabet (16) EUR 7/5} U2 82 0.26 1/47 2 ee 2 al ae 0.02 1/45 
e. Numbers (26) 14.80 IVS yete}5) 0.67 1/47 15.42 15.58 0.03 1/45 
f. Copying (14) 10.33 6.65 22.08*%** 1/47 10.61 6.53 37.81 *** 1/45 
French Comprehension Test 
PeinEetcier als (make 45) 26.00 28.34 NPS) 1/45 26.92 Paleo al 0.32 1/43 
a. Part 1-Words/ 
Sentences (25) 15.00 16.91 2.00 1/45 15.66 16.60 On72 lyase 
b. Part 2-Questions (10) 6.60 Tine: 189 1/45 6.72 Tees LG Oe 1/43 
c. Part 3-Stories (10) 4.40 4.22 0.09 1/45 4.54 Gel5 0.47 1/43 


#* pl <=. Oil 
RKK pp < .001 


with an average IQ score of 110.7, scores in the vicinity of the 75th percentile 
relative to (American) norms for the test and the FD group, with a score of 
114.5, scores slightly above the 80th percentile. 


2. School Readiness. When unadjusted scores are considered, no difference 
is noted between the two groups on overall (total) measure of school readiness 
as measured by the Metropolitan Readiness Tests (MRT), and when the 
various sections of the test are investigated, the only reliable difference that 
emerges is with respect to the section on copying, on which the HD group 
scores significantly higher than the FD group (p < .001). There is no imme- 
diate explanation for this finding. (It is interesting to note that a similar 
trend (p = .10) is noted with respect to matching, which likewise involves 
perceptual processes.) It seems unlikely that this finding can be attributed 
to the effect of the dimension of the curricular approach that has been 
examined (HD vs. FD). 

When scores are adjusted for age and IQ, however (even though the two 
groups do not differ significantly on these two factors), the difference between 
the two groups on overall readiness score, in favour of the HD group, becomes 
statistically significant (p < .01). As seen from the subtest results, this is a 
function of the two sections specified previously, matching (p < .01) and 
copying (p < .001). However, until further evidence such as that derived from 
replication studies and observational data is found to link the results on these 
two sections to the curricular dimension investigated here, caution must be 
employed in interpreting the significant overall difference in school readiness 
between HD and FD groups. 

In terms of (American) norms for the test (on the basis of unadjusted 
scores), HD pupils score at the 73rd percentile and FD pupils at the 66th 


percentile. 
3. French Comprehension. Table 1 reveals that there is no reliable 
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difference between HD and FD pupils in level of French comprehension; thus 
the two curricular approaches, which involve the same amount of contact 
time with French, yield equivalent results in measure of French listening 
comprehension. 


There are as yet no norms for the French Comprehension Test, Primer, on 
the basis of which to evaluate the scores of 26.0 and 28.3 out of 45 obtained by 
the HD and FD groups, respectively. It may be worth noting, however, that 
these scores are in close agreement with an “approximative” score of 25.4 
determined for the test on the basis of prior data from children in a French 
immersion K who had an equivalent amount of contact time with French as 
the SDG pupils, and “actual” scores of 24.3 recorded in French immersion K 
programs in two other localities in Ontario and of 25.6 for pupils following a 
program similar to the SDG one in several schools in the province of Quebec.*® 
These results thus suggest that pupils in the SDG bilingual education 
program (following either HD or FD approach) attain the same level of 
proficiency in French as pupils in a French immersion K who are exposed to 
the same amount of contact time with French.® 

(If the French Comprehension Test data from Tables 1, 2, and 3 are 
combined, counting the HD data in Table 1, which is repeated as the low SES 
data in Table 3, only once, the average score for all Cornwall/SDG pupils 
considered is 27.30.” Thus, overall, the SDG pupils tested score as well as total 
immersion pupils for whom data are presently available.) 

In summary, there appear, therefore, to be few differences in the results 
obtained on the basis of either HD or FD curricular approach. 


B. Locale. The data for the analysis by locale (urban, semi-urban, rural) are 
shown in Table 2. 

1. Age and IQ. No reliable difference is found among the three groups with 
respect to age. On IQ, however, there is a reliable difference among the three 


TABLE 2 


KINDERGARTEN, GEOGRAPHICAL REGION: URBAN (U), SEMI-URBAN (SU), 
RURAL (R); UNADJUSTED AND ADJUSTED MEANS 


Unadjusted Means Adjusted Means (covariates=Age + DIQ) 


U SU F tatlo df U SU R F ratio df 


R 
(N=25) (N=14) (N=15) 


Age (months, end year) (@esal72 69.29 69.87 (oj stil 2y 50 


1. Otis-Lennon Mental 


* 
mi eles ae Bath 118.96 110.21 109.53 3.68* 2/51 


2. Metro. Readiness Test 


Total (102 items) 66.52 63.86 60.87 0.97 2/51 62.61 67.21 64.25 1.42 2/49 
a. Word meaning (16) 10.08 9.29 9.27 0.96 2/51 9.58 9.72 O70 OR 03 2/49 
b. Listening (16) 11.08 OF 9.73 3.46% 2/51 10.65 S596 10.09 0.83 2/49 
c. Matching (14) 9.40 9.43 9.60 0.02 2 Hail tos Hal 10.03 KOVR NS) Ne 2/49 
d. Alphabet (16) 14.16 WR /al W253 1.93 2/51 13.5 LS 257 12.70 0.32 2/49 
e. Numbers (26) UBieoz 13.350 13.07 0.07 Zils 12.41 14.23 13.89 290) 2/49 
f. Copying (14) 8.48 91536 7.07 een 2/51 Dod 3: 10.00 7.68 3.35* 2.49 
3. French Comprehension 
fas Gyeeiaeds (23) 22.48 19.85 22.64 is Gili 2/47 PRS 20.83 2361 Zrels) 2/45 
a. Pt.1-Wds/Sent. (25) 13.09 PSS 14.64 1.42 2/47 12.54 13.62 Ayu le 3.83% 2/45 
b. Pt.2-Questions (10) 3)c)3}f/ 3.592 tora 3.12% 2/47 Diets) 4.28 Bigs / 2.14 2/45 
c. Pt.3-Stories (10) 3.83 Salih 2079 3.34% 2/47 3.64 2.93 Paes 1.42 2/45 
SP pw .05 
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groups (p < .05). A further analysis indicates that the urban group from 
Cornwall (IQ = 119.0) differs from both other groups (p < .05), while there is 
no difference between semi-urban (110.2) and rural (109.5) groups. 


In terms of norms for the mental ability test, the urban pupils score at the 
88th percentile and the semi-urban and rural pupils at the 73rd percentile. 


2. School Readiness. Table 2 indicates that there are no reliable differences 
among the three groups on the total score of the MRT on the basis of either 
unadjusted or adjusted scores. Thus even though urban area pupils have 
higher IQ’s, pupils from the other two regions — rural and semi-urban — are 
still as ready for beginning school work in Grade 1. 

In their study involving urban and rural comparisons, Burstall et al. 
(1974) attributed their findings showing equivalent or better academic 
performance on the part of rural area pupils relative to urban area pupils, to 
differences in the school environments of the two groups. The typical 
environment for rural area pupils was that of the small one-room school, with 
close contacts and close teacher involvement, whereas urban area pupils 
frequented larger more impersonal establishments. In the present study, 
however, these differences are not as apparent; the semi-urban and rural area 
schools are not of the one-room type and resemble in many respects the urban 
area schools. The study thus generalizes Burstall et al.’s (1974) findings 
relative to the urban-rural dimension to more uniform environments than 
those characterized by their study. 


On the MRT as a whole, urban pupils score at the 74th percentile, semi- 
urban pupils at the 69th percentile, and rural pupils at the 63rd percentile. 


When subtests are considered, there is a reliable difference among the 
three groups (p < .05) in listening on the basis of unadjusted scores, with the 
urban group scoring higher (p < .05) than each of the other two groups. When 
scores are adjusted for age and IQ, the difference in listening disappears, but 
one emerges on the copying section (p < .05), with the urban group scoring 
higher than the other two groups. 

3. French Comprehension. As can be seen from Table 2, there is no reliable 
difference among the three groups in level of French comprehension on the 
basis of either unadjusted or adjusted total scores, although there are some 
differences on some of the subsections. The unadjusted scores of 22.5, 19.9 and 
22.6 for the urban, semi-urban and rural groups, respectively, are somewhat 
lower than those recorded for other urban groups in the overall evaluation 
(see Tables 1 and 3), and also somewhat lower than those specified earlier for 
pupils in other localities (“approximative” as well as “actual” scores). 

Generally, therefore, the data suggest that the bilingual education 
program offered by the SDG Board is equally as applicable in semi-urban and 
rural areas as in urban areas. 


C. Socioeconomic Level. The data for the analysis by socioeconomic level or 
status (SES) — low SES vs. MU-SES — is shown in Table 3. As stated 
previously, the class which represents the low SES group is the same as that 
which represented the HD curricular dimension in Table 1, hence the 
identicality of the (unadjusted) data for that group in Tables 1 and 3. 

1. Age and IQ. There is no reliable difference between the two groups in 
age, but there is one with respect to IQ, favouring the MU-SES group 
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TABLE 3 


KINDERGARTEN, SOCIOECONOMIC LEVEL: LOW SES vs. MIDDLE TO 
MIDDLE-UPPER (MU) SES; UNADJUSTED AND ADJUSTED MEANS 


Unadjusted Means Adjusted means (covariates = Age + DIQ) 
Low SES MU-SES FF ratio df Low SES MU-SES’ F ratio df 
(N=15) (N=27) 
Age (months, end year) 70.27 69.19 1.04 1/40 
1. Otis-Lennon Mental 
* 

Ability Test--D1Q 110.67 120.70 4.95 1/40 

2. Metro. Readiness Test 

x 

Total (102 items) 65.87 I3Be33 5.00 1/40 69.17 7L.50 0.87 1/38 
a. Word meaning (16) fleas is} 10.70 9.99%* 1/40 8.68 L045" 6.27% 1/38 
b. Listening (16) 10.13 11.70 6.12* 1/40 10.48 10251 PAA TAY) 1/38 
c. Matching (14) LO S38 10.48 0.00 1/40 11 S017 10.18 1.60 1/38 
d. Alphabet (16) 1s 73 14.85 13.48%*** 1/40 IN C222 14.59 7 .83%* 1/38 
e. Numbers (26) 14.80 16.26 1.35 1/40 P85 15.67 0.03 1/38 
f. Copying (14) 20/533 9533) Le94 1/40 10.86 9.04 7 .46%* 1/38 

3. French Comprehension ns erg 
PesteTorad.. (45) 26.00 372i 41.29%**k 1/39 27.233 36.50 36.75 1/37, 
a. Pt.1-Wds/Sent. (25) 15.00 20.77 24.57%*k 1/39 15591 20.24 19.60*** 7 37 
b. Pt.2-Questions (10) 6.00 8.46 20.07*** 1/39 6.71 8.40 14.63*** = 1/37 
ec. Pt.3-Stories (10) 4.40 8.04 41.42%** 1/39 Ava 7.86 32.31 *** WS y 


x p -05 


< 
ee OS 2 OL 
kK 1 < 2001 


(p< .05). In terms of norms, the low SES group scores at the 74th percen- 
tile and the MU-SES group at the 90th percentile. 


2. School Readiness. On the basis of unadjusted scores, the MU-SES group 
scores significantly higher than the low SES group on the total MRT test 
(p < .05) as well as on three subtests (word meaning, listening and alpha- 
bet). However, when scores are adjusted for age and IQ, the overall test 
difference disappears, while the subtests that reveal significant differences 
are word meaning, alphabet and copying (in favour of the low SES group 
in the latter instance). 

In terms of norms for the test (unadjusted scores), low SES pupils score at 
the 73rd precentile and MU-SES pupils at the 86th percentile. 


3. French Comprehension. The MU-SES group scores significantly higher 
(p < .001) than the low SES group on the French Comprehension Test, on the 
basis of unadjusted as well as adjusted scores. This applies also to all 
subsections of the test. The score of the low SES group (26.0 out of 45), 
however, appears adequate and, as previously stated, in line with data from 
total immersion programs, so that the significant differences between the two 
groups is attributable primarily to the high level of performance of the MU- 
SES group (37.3), which is atypical relative to other scores obtained in the 
study (Tables 1-2). Their level of performance may thus be representative of a 
school/teacher effect. 


Grade 2 


As stated earlier, in Grade 2 the French option consists of daily 40-minute 
periods. Pupils of Cohort I who were enrolled in the bilingual program in K (in 
1973-74) do not constitute separate classes but are to be found with pupils who 
began the French-as-a-second-language program (40 minutes/day) in Grade 1 
Olas 
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Sample 


In the 1975-76 evaluation, only a few Grade 2 classes were tested. The 
experimental group consisted of pupils who had attended the K bilingual 
program and had taken French (40 minutes/day) in Grades 1 and 2. The 
comparison group consisted of pupils from other comparable classes who had 
not taken any French throughout K-2. The experimental classes selected were 
those which retained a large proportion of pupils who had followed the K 
bilingual program. The two groups were represented in both urban and rural 
areas, with pupils from low and middle SES backgrounds. The experimental 
group consisted of 4 classes involving 76 pupils and the comparison group of 3 
classes and 73 pupils. 


In the analysis, a number of pupils were excluded, as follows: (a) pupils 
who did not satisfy the group definition of either French or no French 
instruction throughout Grades K-2; (b) pupils whose age fell outside a 
specified age range (74-110 months), which is + 18 months from the “normal” 
age of 92 months for Grade 2 pupils in May (assuming that they begin in 
Grade 2 at 7 years or 84 months). This criterion was used to exclude atypical 
cases; (c) pupils unable to understand or express themselves in English, as 
indicated by a linguistic code assigned by the teacher; (d) pupils for whom no 
IQ data (employed as a covariate) was available. 


In the case of each group, 14 pupils were excluded on the basis of the above 
criteria; thus the analysis was based on 62 experimental and 59 comparison 
pupils.® 


Test Battery 
The following three tests were administered to Grade 2 pupils: 


1. Canadian Cognitive Abilities Test (Primary 2). This test of mental 
ability does not require any reading and consists of four parts: (i) oral 
vocabulary, which measures the child’s ability to label or name objects or 
actions or to identify objects when given their use; (ii) relational concepts, 
which measures the ability to identify size, position and quantity; (iil) 
multimental (“one that doesn’t belong’), which measures the ability to see 
relationships and to categorize or classify objects; (iv) quantitative concepts, 
which measures the ability to deal with quantitative relationships and 
concepts. The raw score on the test is converted to a (deviation) IQ measure. 


2. Canadian Tests of Basic Skills (Primary Battery, Level 8). This is a 
battery of tests which provides for comprehensive measurement of growth in 
several areas. There are five sections to the test: (i) vocabulary, which 
measures the reading and knowing of words (on the basis of pictorial or 
incomplete sentence stimuli); (ii) word analysis, which involves sound-letter 
association, phonetic analysis and word structure; (iii) reading, which 
involves sentences and stories; (iv) spelling, where the child has to select the 
misspelled word from three choices; and (v) mathematics, which contains two 
parts, one based on concepts and the other on problem solving. In the scoring, 
the two mathematics sections are also combined to yield a total mathematics 
score, and all five sections of the test (vocabulary, word analysis, reading, 
spelling, and total mathematics) are in turn combined to yield a total test 
composite score. 

3. French Comprehension Test (Level 1). This is a test of French listening 
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TABLE 4 
GRADE 2, UNADJUSTED AND ADJUSTED MEANS 


Unadjusted Means Adjusted Means (covariates = Age + IQ) 
Exp. Gr. Comp. Gr. F ratio df EXpj, Gri. Comp. Gr. F ratio df 
(N=62) (N=59) 

Age (months, end year) 95.91 96.29 0.28 Ta) 

Canadian Cognitive : 

Abdlities Teate-DLO 104.18 100. 34 7 1222 

Canadian Test Basic Skills 

Grade Equiv. Sc. 
1. Vocabulary Dn HSS 2.79 0.01 1/120 2eo 2.85 OF9L 1/118 
2. Word analysis 2.84 PAS TRS) 0.09 Wa BST) 2.82) 0.20 1/120 
3. Reading 2.16 PAs) ilyitn 1/119 Deve) 3.00 5.28% 1/117 
4. Spelling 2.49 2.84 4.56% 1/119 2.44 2.89 8.34%* 7) 
5. Math concepts 225 PLETE 1.88 ay 2a 25S LROS 10.21** 1/119 
6. Math problems 2.60 2.87 4.57% 1/120 2555 PR Ye 14.78*** 1/118 
7. Math Total 2.60 2.86 4.38% 1/119 2.55 2.91 18. 59%** ny Ay 
8. CTBS Composite 2.67 2.86 1.89 1/116 2.60 292 8.17%* 1/114 


French Comprehension Test, 
Level 1, Total (max.=45) 
Pt.1-Wds/Sent. (22) 4 
Pt.2-Questions (9) Des SHS} --- 
Pt.3-Stories (14) 4 


* p< 7.105 
kk p< .01 
RK p< 001 


comprehension similar in format to the one employed in K. The Level 1 test is 
intended primarily for pupils completing Grade 1 of a total French immersion 
program. 


Results 
The Grade 2 results are found in Table 4. 


1. Age and IQ. As seen from Table 4, there is no reliable difference between 
experimental and comparison groups with respect to either age or IQ. In 
terms of norms for the Canadian Cognitive Abilities Test, the experimental 
pupils score at the 60th percentile and the comparison pupils at about the 
50th.?9 


2. English and Mathematics Skills. Table 4 presents the results on the 
Canadian Tests of Basic Skills in terms of grade equivalents. Since pupils 
were tested in May, their grade placement at the time of testing was 2.8. As 
can be seen from the data (unadjusted scores), the comparison group scores at 
approximately that level on all sections of the test (English and mathematics 
sections as well as composite or total test score), while the experimental group 
obtains slightly lower grade equivalent scores on a few sections of the test. 
Statistically, the comparison group scores significantly higher (p < .05) than 
the experimental group on spelling, mathematics problems and mathematics 
total on the basis of unadjusted scores. When scores are adjusted for age and 
IQ, these differences are maintained (at p < .01 or p <.001), and in addition 
the comparison group scores significantly higher on reading (p < .05), 
mathematics concepts (p < .01) and composite test score (p < .01). 


It would thus appear that the experimental group is generally somewhat 
behind the comparison group in English and mathematics skills by the end of 
Grade 2. It may be conjectured that this is attributable to the time devoted to 
French (40 minutes/day) in the experimental group’s program in Grades 1 
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and 2, which reduces by the same amount the time devoted to instruction in 
English. Whether the bilingual education experience of the experimental 
pupils in K is also a contributing factor is difficult to determine; in terms of 
amount of schooling in English in K, the experimental pupils had the same 
amount (the equivalent of half a day per day) as the comparison pupils, and 
as stated previously, obtained similar results with respect to school readiness 
(on the basis of more extensive samples of pupils), so that the effect of the K 
experience on the differential Grade 2 results is unlikely. Aside from the time 
factor as a possible explanation for the Grade 2 results, it may be that there 
are other aspects to the situation: the introduction of the French component 
for short daily periods may lead to temporary disturbances in the classroom 
which have some effect on the teaching of other program material or on the 
motivational/attitudinal climate of the classroom. The situation needs to be 
monitored more carefully. 


It is interesting to note that the present study is one of the few that make 
comparisons between a French-as-a-second-language program (40 minutes/- 
day) and no-French as this factor relates to performance in other subject 
areas (English, mathematics). The majority of studies have typically 
considered a “massive” French program (e.g., total or partial immersion) in 
relation to either a French-as-a-second-language program (e.g., Swain & 
Banik, 1976) or no-French (e.g., Barik & Swain, 1976a), but the issue of the 
relationship between a regular French-as-a-second-language program (40 
minutes/day) and a no-French program has seldom been considered. 


3. French Comprehension. On the French Comprehension Test, Level 1, 
the experimental pupils obtain a score of 12.3 out of 45. There are no other 
data with which this score can be compared directly, as there is no other 
group following exactly the same program. However, on the basis of data 
found in Barik, 1976b (p. 16), several indirect comparisons are possible. One is 
with a Grade 2 bilingual program where, following English K, pupils receive 
half of their instruction in English and half in French throughout Grades 1 
and 2. These pupils obtain a score of 12.5 on the test. Another comparison is 
with a Grade 1 intensive French program which involves a half-day English, 
half-day French program in K (thus similar to the Cornwall situation), 
followed by 90 minutes of French per day in Grade 1, which thus yields 
approximately the same amount of contact time with French as that 
encountered by SDG pupils in the course of Grades 1 and 2 (a cumulative 
amount of 80 minutes per day). Pupils in the intensive French Grade 1 
program obtain a score of 13.0. A third comparison is with pupils in a Grade 2 
core French program who receive 30 minutes of French per day in K (in 
contrast to the SDG pupils) and 40 minutes per day in Grades 1 and 2 (similar 
to the SDG group). Pupils in the core French program obtain a score of 10.6, 
which is somewhat lower than that of the Cornwall Grade 2 group. The score 
of the SDG pupils would thus appear to be adequate when placed in relation to 
these other groups. 


Conclusions 


The findings reviewed above with respect to the K bilingual program and 
the Grade 2 program in schools under the jurisdiction of the SDG County 
Board of Education are summarized below. 
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In the Kindergarten program: 


1. With respect to the curricular approach: (a) There are no reliable 
differences between pupils following the HD approach and those following 
the FD approach with respect to age or IQ; (b) With respect to school 
readiness, there is no overall difference between the two groups on the basis of 
unadjusted scores, but there is an overall difference favouring the HD group 
when scores are adjusted for age and IQ. This difference seems to be a 
function of two subtests, matching and copying, which involve perceptual 
and motor processes; (c) HD and FD pupils exhibit similar levels of French 
comprehension, which are commensurate with those obtained by pupils in 
total French immersion K programs who have the same amount of exposure 
to French. 


2. With respect to locale: (a) There is no reliable difference among urban, 
semi-urban, and rural area pupils with respect to age, but there is one with 
respect to IQ, with urban pupils scoring higher than the other two groups; (b) 
No reliable difference is found among the three groups in overall measure of 
school readiness on the basis of unadjusted as well as adjusted scores. A 
difference favouring the urban group is found on the listening section on the 
basis of unadjusted scores, while one favouring the semi-urban group is found 
on the copying secton when scores are adjusted for age and IQ. In general, 
then, pupils from all three areas appear to be equally ready for beginning 
school work in Grade 1; (c) There is no reliable difference in overall level of 
French comprehension among the three groups. 


3. With respect to socio-economic status (SES): (a) There is no reliable 
difference between low SES and MU-SES pupils attending the bilingual 
program with respect to age, but there is one favouring the higher SES group 
with respect to IQ; (b) MU-SES pupils score higher in school readiness than 
low SES pupils when unadjusted scores are considered, but the overall 
difference disappears when adjustment is made for IQ (and age) differences; 
however, reliable differences in favour of the MU-SES group remain with 
respect to word meaning and alphabet, while the low SES group scores 
significantly higher on copying; (c) MU-SES pupils score significantly higher 
than low SES pupils on a measure of French listening comprehension, but 
this is due to their atypically high level of performance relative to other 
groups tested, indicative possibly of a school/teacher effect. The performance 
level of the low SES group appears adequate in relation to other comparative 
data. 


In the Grade 2 program, (a) There is no reliable difference between the 
experimental pupils following the French option (40 minutes per day in 
Grades 1 and 2, following bilingual K) and the comparison pupils not taking 
French, with respect to either age or IQ; (b) Pupils in the experimental group 
score significantly lower than pupils in the comparison group with respect to 
most aspects of English and mathematics skills when scores are adjusted for 
age and IQ. This may be a function of the reduction in the amount of 
instruction in English which they receive; (c) On a measure of French 
comprehension, the experimental pupils score similarly to pupils in other 
programs with whom some indirect comparisons can be made. 


The results from this study thus suggest that a French immersion K can be 
effectively undertaken in rural as well as urban areas, and with low as well as 
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middle-upper SES groups. Considering the results related to English 
achievement at the Grade 2 level, however, leads one to question the efficacy 
of following the immersion experience with a 40-minute a day French option. 
In other programs which have continued the immersion experience through 
later grades, French language performance is considerably better and, 
additionally, no long-run detrimental effects on English language skills have 
been noted. 


* Thanks are expressed to the administrators of the Stormont, Dundas Glengarry County Board of Education, to 
the principals and staff of the schools participating in the study, for their cooperation. Special thanks are 
addressed to Denis Boissonneault and Donalda Smith. We also wish to express our gratitude to Edna A. 
Nwanunobi for her assistance in the analysis of the data, and to Sharon Lapkin, who acted as liaison with the 
Board authorities. 


The research reported in this paper was funded by a Grant-in-Aid of Educational Research from the Ministry 
of Education for the Province of Ontario. 


Notes 
1 Although the Board envisaged separate programs in Grade 1 and subsequent grades 
for pupils graduating from the K bilingual education program, this has apparently 
not been administratively feasible. 
2 The Cornwall results discussed here (1973-74 and 1974-75) are based on unpublished 
data. 


3 The Cornwall area generally lacks a high SES population, so that it was not possible 
to have a more marked distinction, such as between low SES and high SES. 

4 Although in the present Canadian context some occupations no longer quite 
correspond to their assignment in the Blishen (1958) classification scheme, e.g., 
certain types of engineering professions may no longer rate 1 in terms of socio- 
economic dimension, the scale remains a useful instrument. 

5 The ‘“‘approximative” score was obtained on the basis of an earlier version of the test 
by considering only the items that appear in the published version (Barik, 1976a). 
The ‘“‘actual” scores cited were provided by school boards which made use of the test 
in evaluations of their French programs. The data from these school boards and from 
others will be used to derive norms for the test. 

6 The present results thus contrast with those recorded in 1974, which showed the SDG 
pupils scoring lower in French comprehension than total immersion pupils, on the 
basis of a different version of the test. 

7 This figure is based on 123 pupils rather than on the N of 130 indicated by the group 
N’s at the start of each table, since 7 pupils lacked a score on the test. 

8 Criteria similar to (b) and (c) above were also in effect for the K data, but did not lead 
to the exclusion of any pupils. In addition, in the analysis of French comprehension 
test data, the scores of pupils coming from a home environment where French was 
used as a means of communication were eliminated (on the basis of score criteria). 
This affected one pupil in Grade 2, but none in K. 

9 The substantial differences in the percentile ranks associated with the Grade 2 data 
relative to those for K discussed in relation to Tables 1-3 likely reflect the fact that the 
norms were developed with different populations — American norms for the Otis- 
Lennon test in K and Canadian norms for the Canadian Cognitive Abilities Test in 
Grade 2. 
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Effect of Practice on Form Discrimination 
Learning: A Test of the Distinctive Features 
and Schemata Hypotheses 


Thirty subjects (15 males and 15 females) were presented with a visual form 
discrimination task which required subjects to match Roman letters when a 
variety of transformations were held constant. Subjects were given massed 
practice across four blocks of eight stimuli each. The purpose was to compare 
the effects of practice under both the distinctive features and the schemata 
hypotheses. Results of a repeated measures analysis of variance showed 
support for the distinctive features hypothesis. This result was expected since 
familiar stimuli, used with transformations which are held constant, tend to 
produce no improvement along a practice dimension. (Dr. Tyler is with 
Teledyne Brown Engineering in Rockville, Md. and Dr. Hardy is with the 
Institute for Child Study at the University of Maryland.) 


Current perceptual learning literature brings to light two hypotheses 
(distinctive features or specificity hypothesis and schemata hypothesis) on 
whether children’s perceptual judgments improve with practice. The first 
hypothesis states that distinctive features are dimensions of object and 
pattern differences. Improvement of discrimination among objects or 
patterns consists of learning the specific distinctive features. The function of 
practice, then, enables one to respond to an increasing number of features and 
to discover which features are “critical” for defining the objects or patterns. 


The second hypothesis suggests that prototypes or models of objects or 
patterns have been built up through repeated experiences and “stored” in 
memory. Improvement in discrimination would first involve building a new 
schemata or model for making discriminations and then matching sensory 
stimuli to the existing schemata. The function of practice allows for the 
construction and refinement of appropriate schemata (Pick, 1965). Thus, the 
operation starts with an individual recording data, such as various sizes and 
arrangements of the letter ‘““A,”:and organizing them in a schemata. This 
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schemata is then used to direct subsequent perceptual activities involving the 
letter “‘A.”’ 


Frederickson and Guerin (1973), using psychophysical techniques, 
compared reduction of the Mueller-Lyer illusion among three groups of 
subjects — children aged 8 to 12, adults aged 18 to 25, and adults aged 60 to 70. 
Using a variant (red horizontal line) and an invariant (black horizontal line) 
as comparisons and providing no feedback, they found that practice with red 
horizontal lines produces greater reduction in illusion than does practice with 
black horizontal lines. Additionally, the sample of younger adults 
demonstrated greater reduction in illusion under both conditions than either 
the children or the older adult samples. These results were interpreted as 
consistent with the distinctive features hypothesis since the stimuli presented 
became more specific with time and practice. 


Pick (1965) used cognitive psychological techniques with three groups of 
kindergarten children to compare the effects of practice under both the 
distinctive features and the schemata hypotheses. In the first group, a control 
condition, the children were trained on one set of stimuli and transformations 
and tested on a transfer task which used another set of stimuli and 
transformations. In the second group, schemata condition, the transfer task 
consisted of the same stimuli but different transformations, while in the third 
group, distinctive features condition, the transfer task consisted of new 
stimuli but the same transformations. This latter group had the fewest errors. 
Thus, Pick concluded that learning distinctive features of form may be a 
significant component for improvement in the discrimination ability of 
young children. 


These two hypotheses may not be mutually exclusive. A basic tenet of both 
of these hypotheses is that the ability to discriminate the unfamiliar improves 
with practice, both with or without knowledge of results. Thus, the formation 
and subsequent refinement of a schema may necessitate the individual to 
notice particular distinctive features in a stimulus so that he is able to change 
the initial schemata that is being used to guide encoding while he is doing a 
task. 


If, however, the stimuli are already familiar to the subjects and the 
transformations are the same, one would expect, according to the distinctive 
features hypothesis, that no improvement would occur with repetition; 
however, according to the schemata hypothesis, one would expect repetition 
to improve discriminations as one practises transformations using familiar 
stimuli. It is the specific purpose of this study to investigate the validity of 
these two hypotheses using familiar stimuli, letters of the alphabet, with a 
variety of transformations held constant over four massed practice con- 
ditions. 


Method 
Subjects 


Subjects were 15 males and 15 females ranging in age from 3.10 years to 
4.11 years who were randomly selected from three preschools in the Menlo 
Park, California, area. The mean age was 4.3 years. 
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Materials 


The materials consisted of a portable box and 32 stimuli cards. The 
portable box had five plexiglass windows. These windows faced the subject. 
One subject response button was located beneath each of windows 2.04: and 
5. Window 1 was on the far left and always displayed the target on the 
stimulus card. This target form was to be matched as “same” with one of the 
four other forms presented simultaneously in the other windows. The serial 
positioning of the windows encouraged subjects to search the stimuli from left 
to night. 

During each subject’s trials, the experimenter held a small console unit 
with four indicator lights. When the subject pushed a correct response button, 
a red light flashed on the experimenter’s console unit. This arrangement 
allowed the experimenter to sit to the side and behind the subject making it 
possible to keep the subject’s attention directed towards the task while still 
being able to accurately determine his response. 

Each of the 32 stimuli cards was a 20 inch x 6 inch (50.80 cm. x 15.24 cm.) 
masonite board bearing five 4 inch x 4.5 inch (10.16 cm. x 11.43 cm.) black and 
white, high gloss, photographed stimulus forms. Each of the five forms 
appeared through the appropriate window of the portable box. The forms 
consisted of lower-case Roman letters(abcefghijkmnoqrstuvwyz) 
and were chosen from a confusion matrix (Gibson, Osser, Schiff, & Smith, 
1963). 

The letters were rotated and arranged into 16 simple and 16 complex 
stimuli cards. A simple card presented the same letter in each window. This 
letter was rotated in either four different horizontal-vertical positions or four 
different oblique positions. A complex card presented two letters in each 
window. These letters were also rotated in either four different horizontal- 
vertical positions or four different oblique positions. 

The 32 cards were randomized into four blocks of eight stimuli each. The 
four response choices on each stimulus card were also randomized so that 
each of the four nontarget windows presented the correct stimulus match an 
equal number of times. 


Procedure 
Each subject received an individual reading of the instructions and was 
given a demonstration trial before beginning the matching task. The 
instructions were as follows: 
You sit here and I’ll sit next to you and show you how to play a game (put card in 
box). Look at all these letters (point). Now look at this letter (point). Press the red 
button with your finger when you have chosen the letter that looks the same as this 
one (take card out). Now I’m going to show you some more letters. Look at each 
letter and press a red button under the letter that looks just like the letter in this 


window (point). 
If the subject responded correctly on the demonstration trial, the ex- 
perimenter proceeded to give the 32 trials saying each time “which letter looks 
just like this one.” If the subject did not respond correctly to the 
demonstration trial, the instructions were repeated. All subjects responded 
correctly on either the first or second demonstration trial. 

The dependent variable was the number of error responses. To test the 
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distinctive features and schemata hypotheses, a repeated measures design 
with four levels of starting block position and 30 subjects was used. 


Results 
The means and standard deviations for the four block positions are 
presented in Table 1. The higher the score the more ineffective was the 
subjects’ performance on the task. An examination of this table shows that 
block four had the highest mean score. 


TABLE 1 


MEANS AND STANDARD DEVIATIONS? ON ERROR RESPONSES 
ACROSS FOUR PRACTICE BLOCKS 


Practice Block Positions 


i 2 3 4 
Zeit, DANO ZohO oes 
(Gli 3'5)) (1.79) (1.62) (G25) 


a ee ; 
Standard deviations are in parentheses 


Table 2 presents a summary of the repeated measures analysis. The results 
indicated that block position (practice) had no effect on performance, F(1,87) = 
1.42, p > .05. Since there were no practice effects, these results support the 
distinctive features hypothesis as opposed to the schemata hypothesis. A 
further examination of all possible pairs of means was made. Using a 
Neuman-Keuls method as described in Winer (1962, pp. 309-310), it was found 
that there were no significant differences, p > .05. This lends additional 
support to the distinctive features hypothesis. 


TABLE 2 


REPEATED MEASURES ANALYSIS OF VARIANCE WITH FOUR LEVELS 
OF PRACTICE BLOCK POSITION 


Source Ss df MS F 
Blocks 7.57 3 2.52 urge" 
Subjects TANS, Sh0) 29 3291 

Error 1545143 87 lly Ze: 

Total PAT ays SiO) mes) 


* Not significant 


Discussion 
Table 1 indicates that errors increase with practice. The child with 
knowledge of the distinctive features of familiar alphabetical letters made 
fewer errors in the first practice block than in the fourth practice block. 
Repetition did not facilitate learning the unfamiliar, same type transforma- 
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tion. Thus, the results failed to support the schemata hypothesis, and instead, 
supported the distinctive features hypothesis. 


One reason why this study might not have supported the schemata 
hypothesis is that the stimuli were too confusing. Piaget and Inhelder (1969) 
have indicated that four-year-old children do not perform well in tasks 
requiring them to deal with more than one dimension along a stimulus 
variable. This suggests that tasks which include more than one stimulus 
variable may be too confusing to the young child. 


For example, in this study, the child was required to recognize letters and 
their transformations within simple and complex patterns. The large array of 
stimuli may have introduced more information than the child could use in 
terms of matching a prototype to a cognitive model. Practice, therefore, may 
have been ineffective since the children were unable to utilize information 
along several dimensions. However, since the children used in this study gave 
correct answers in approximately 64 percent of the instances, it would tend to 
indicate that the stimuli used were not too difficult. The argument suggesting 
that the stimuli were too confusing would, therefore, not be an appropriate 
explanation as to why the schemata hypothesis was not supported. 


An alternative reason as to why the schemata hypothesis might not have 
been supported concerns the novelty of the stimuli. Gibson (1969) noted that 
practice does not facilitate form perception if the stimuli are not novel. Herein 
lay a difficulty in proving the schemata hypothesis. Lower case Roman 
letters are generally familiar forms within the visual experience of most four- 
year-old children. The unusual aspect of the stimuli was its various 
transformations. These transformations, however, may not have been 
perceived as unusual by the child who does not yet understand why a “d” 
functions differently from a “b.” In this sense, transformations may have 
been ignored. 

The results of this study clearly support the distinctive features 
hypothesis. Since no data is available concerning the novelty aspect of the 
stimuli, this argument, as previously mentioned, provides an alternative 
explanation of the data. Further research investigating these two hypotheses 
should be addressed to this issue. 
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The basis for this research is a theory of rational numbers, which sees the 
rational number construct of a person as potentially built on a number of 
subconstructs. This research explored the view of rational numbers as 
operators as held by children and adolescents. Subjects ranging in age from 8 
to 14 years were faced with six task settings involving operator representations 
of the rational numbers 1/2, 1/3, 2/3, 3/4, 1/2 x 1/3, 1/2 x 3/4. Significant 
changes in performance level and quality appeared between ages 11 and 12 
and particularly between ages 12 and 13. Based on the findings, it was 
hypothesized that three levels of “operator” development exist, and that 
subjects use such constructive mechanisms as partitioning to control the 
operator situation. (Dr. Kieren is a Professor in the Department of Secondary 
Education and Dr. Nelson, a Professor in the Department of Elementary 
Education, at the University of Alberta.) 


The rational numbers (commonly seen in the form of fractions, decimals or 
percents) are of considerable practical importance. They are the numbers 
used to measure or label most continuous quantities and to quantitatively 
compare two or more entities in a multiplicative manner. They are also of 
considerable mathematical significance both in the area of analysis and 
algebra. This importance has been recognized in the significant amount of 
instructional effort placed on rational numbers, instances of which are to be 
found in the curriculum for children aged 9-14. Yet, for all this effort, rational 
numbers or “fractions” have historically and currently proved to be a topic in 
which achievement was lower than anticipated (De Morgan, 1943; Carpenter, 
Coburn, Reys & Wilson, 1975; Ginther, Ng & Begle, 1976). 


Formal instruction with whole numbers has focussed on computation. 
However, it is based upon a large amount of informal and practical 
experience with whole number settings and counting. Instruction with 
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rational numbers has been based almost solely on a computational construct. 
This construct pictures rational numbers as objects of computational 
manipulation. There is little relationship here to a body of experience of 
children or to the broad practical spectrum. Rather, rational number 
computations are pictured as extensions of whole number computations. As 
suggested above, this approach, even carefully carried out, has not developed 
high standards of student achievement. 

The study reported here is based on a much broader construct of rational 
numbers. This construct (Kieren, 1977) has, as its capstone, mature 
functioning in a spectrum of rational number settings. Underlying this 
maturity is a well developed concept of equivalence and a broad base of 
experience in the areas of five basic sub-constructs: 


rational numbers as part-whole relations; 
rational numbers as ratio numbers; 
rational numbers as quotients; 

rational numbers as measures; 

rational numbers as operators. 


In addition, this construct includes mechanisms (e.g., partitioning) by which 
persons deal with the meaning of rational number situations. Many steps are 
required in proceeding from this complex construct to a rational number 
curriculum. It is an assumption of this research that one significant step is an 
analysis of the way in which children and adolescents react to, think about, 
and develop the various sub-constructs. In the case of this study, the operator 
construct was the focus of attention. 


Purpose of the Study 

The purpose of the study reported was to explore the view of rational 
numbers as operators held by children and adolescents. The operator 
construct interprets a rational number as mapping one number to another. 
For example, it interprets the rational number 2/3 as relating 12 and 8, 6 and 
4, 2 and 4/3, etc. This construct leads also to the interpretation of 
multiplication as function composition; e.g., if one reduces a picture to 3/4 size 
then the result by 1/2, the overall result is a 3/8 reduction (3/8 = 3/4 x 1/2). 
The research was directed to several exploratory questions. 

1. Is there a change over age of reaction to operator settings? 

2. Do there appear to be stages in this development? 

3. Will there be difficulties with the concept of inverse in these settings? 

4. Are there differences in reaction to unit as opposed to non-unit fractions 

(1/3 as opposed to 2/3)? 

5. What mechanisms do children and adolescents use in handling these 

settings? 


The Research 


The Instrument 

In order to answer the above questions, a rational number operator setting 
was developed according to the specifications for a good mathematical 
problem developed by Nelson and Kirkpatrick (1975). The setting revolved 
around a machine which made fractional changes in sets of discrete objects. 
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After trials with children and critical suggestions, the following setting 
evolved.! 


The subject is faced with a large colourful box with input and output doors. 
He or she is told that the machine is a packing machine that is programmed 
to work a certain way. They are to try the machine by placing certain 
numbers of objects into the machine and counting the output. After a number 
of trials (up to 6), they are asked to predict the result if they placed certain 
numbers of objects into the machine (5 items). The subject is then asked to 
give the number of objects needed to produce a certain output (inverse, 5 
items). Finally for each task, the subject is asked to explain how the machine 
is operating. 

The following explanation details the dialogue between experimenter (E) 
and subject (S) on the “1/2 machine’, with E saying, “You and I are going to 
pretend this (points to box) is a packing machine. It takes pieces of paper 
(points to small sheets) and packs them in boxes (shows simulated boxes). 
You load paper in here (points to input chute) and packages come out here 
(points to output chute). 

“This machine is controlled by a computer that tells it how to work. 
Pretend you are the manager and walk into the machine room and want to 
find out how the machine is working. No oneis around so you decide to try the 
machine out and see how it works. 


“Why don’t you put in 12 of the sheets?” S counts and does so. Machine 
outputs 6. “Count how many came out.” S counts 6. Repeat the above 
sequences using 16, 10, 8, 18, 6 as input values. 

E now hands S a sheet with IN and OUT columns on it. EF says, ‘“‘Can you 
fill out this sheet telling how many packs came out if you know how many 
sheets are put in?” 

S then fills out the first sheet containing 5 items (all Ss did this without 
difficulty in understanding what was asked). 


E now probes, ‘“‘Why did you say 7 out when 14 went in?” EF chooses several 
examples as deemed necessary. S gives free responses. 


E now asks, “Can you tell how the machine is working?” Again, S givesa 
free response. 


E now asks, ‘“‘Can you fill in this sheet (hands inverse sheet to S)? Now you 
know how many packs came out — here on 1. 10 came out. You have to tell 
how many sheets you have to put in to get 10 packs out.” 

S fills in sheet containing 5 items. E then probes as in the direct situation 
above. 


The student is then told the machine has been reprogrammed (an analogy 
is made to a record player or tape player) and it is stressed that the machine is 
working differently. The protocol above with direct and inverse items is 
repeated. 

In all, the subject could be faced with five direct and five inverse items on 
six tasks. One variable in the tasks was whether the operator involved was a 
unit fraction (1/2, 1/3, 1/2 x 1/3 or 1/6) or a non-unit fraction (3/4, 2/3, 1/2x 
3/4 or 3/8). The second variation in tasks contrasted single operators (1/2, 
1/3, 3/4, 2/3) with composed operators (1/2 x 1/3, 1/2 x 3/4). In the latter case 
the subject was faced with two machines. The first, already familiar, put 
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sheets into packs. The second took the output of the first machine (packs) as 
input. It then put out simulated boxes of packs. These composition machines 
were presented using the above protocol illustrating the entire functioning of 
the machines in sequence. Thus the subject placed 16 sheets in the “1/2” 
machine and saw 8 packs come out and be placed in the “3/4” machine from 
which 6 boxes emerged. The subject after up to 6 trials with feedback had to 
predict the action of the composed machines. 


These tasks were presented to subjects individually, with each subject 
allowed as much time as needed to complete the task. Each subject started 
with a “1/2 machine” and proceeded until he or she was unable to make 
predictions or understand how a machine functioned (by the subject’s own 
statement). The subject was then briefly taught a partitioning strategy for 
running the machine (2 teaching trials). The experimenter and the subject 
reversed roles and the subject “ran” the machine. In this way, the subject’s 
susceptibility to instruction on operators was explored. 


Each subject recorded his own predictions on the direct and inverse items 
on attempted tasks, after first receiving up to 6 trials with manipulations and 
feedback. In addition the experimenter made a written record of the students’ 
descriptions and discussions of the machine functioning. 


In terms of being a viable problem, the machine setting proved successful. 
All of the children and adolescents tested reacted to and functioned within the 
setting. 


The Sample 


The choice of the sample in this study was influenced by theoretical and 
practical considerations. Since the current curriculum in rational numbers 
falls mainly between grades 4 and 8, a sample across those grade levels has 
some practical importance. This grade span reflecting an age span of 9-14 is 
appropriate for theoretical reasons as well. By age 9, the ability to make basic 
fractional recognitions should be present (Piaget, Inhelder & Syzminska, 
1966), as should a minimal competence with reversibility which in turn would 
allow a child to function with the “inverse” situations. Because the operator 
construct implicity involves ratio ideas, the relationship to the development 
of the proportionality schema seems important. Lovell (1971) suggests this 
development probably occurs between the ages of 12 and 14. Thus, it was 
necessary for the sample to include children up to that age level. 


The study was conducted in an elementary (Grades 4-6) and junior high 
school (Grades 7-8) in Walton County, Georgia. The students came from rural 
areas and a small city. A random sample of 45 students was drawn, stratified 
for research and practical reasons. With the exception of Grade 4 with 5 
subjects, there were 10 subjects drawn at each grade level, 5 males and 5 
females. Further the sample was selected so that no more than two subjects 
came from any one classroom, this being done to minimize disturbance to 
teachers and classrooms. The age range of the sample was 8 years, 11 months 
to 14 years, 7 months. 


Procedures 
Each subject was interviewed individually in a room provided in each 
school for the purpose. Within schools the interview occurred in random order 
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with the time of each interview coinciding with the needs of the subject. The 
actual times ranged from 17 to 69 minutes with a median time of 44 minutes. 


Each interview followed a set protocol, adapted to insure clarity for each 
subject. Following an introduction and development of the problem, 
sequences of trials with feedback, prediction trials and descriptive requests 
were repeated for each of the six tasks. This was always started with the 
1/2” task followed byr*2/3 2017370" 143ex 1142 yeand 53/495. 2 tasks. 
Because of performance, all students did not attempt all tasks; for example, 
subjects who did not have success on the “3/4” task did not attempt the “3/4 x 
1/2” task. The minimum number of tasks attempted was 1 with the median 
number being 4. 


Analyses 


In order to answer the research questions posed above, the following 
analysis was carried out. The sample was split into four age groups as seen in 
Table 1 below. A one way analysis of variance with orthogonal contrasts was 
carried out to test the following hypothesis. 


Hypothesis 1. There are no significant differences among the mean 
number of items correct on the operator tasks for subjects in the four age 
groups. 

A nominal analysis was used to help determine if stages in operator 
thinking were observed. 

Two tests of independence were conducted to test the following 
hypotheses. 

Hypothesis 2. Performance on direct items are independent of performance 
on related inverse items. (e.g., 1/2 direct, 1/2 inverse). 

Hypothesis 3. Performance on unit items is independent of performance on 
related non-unit items (e.g., 1/3, 2/3). 


Results 
Instrument 


The content and construct validity of the instrument has been discussed 
above and was not formally tested. The reliability of the instrument was 
ascertained using the split half method and was found to be .97. 


TABLE 1 
MEANS AND STANDARD DEVIATIONS ON TASK ITEMS 
OF FOUR AGE GROUPS 
Age Level 
Slt Ibs os <a Ta Si ol) Soper 
N 1, 10 atl 12 
Mean 9.4 Ilbye ais 28.9% Shilie 3) 
s.d ao] ed Wehic, Il 16.9 
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Age Level Analysis 


Table 1 below contains the age level statistics. Hypothesis 1 is rejected 
(F'3,41 = 6.69, p < .01). Because the group sizes are nearly equal, the analysis 
was held to be robust even though there were differences in the variance. 
Constructs revealed significant differences (p < .02) between subjects 
younger than 11 and those in the 11- to 12-year-old group and between that 
group and the 12- to 13-year-old group. 


Nominal Analysis 


The numerical differences reported above are supported by differences in 
the qualities of behaviour at different age levels. To get at this, a careful 
analysis of the kinds of tasks mastered (4 of 5 items) by each subject was 
made. From this and from student transcripts, six descriptive categories were 
developed. These and tallies at each age level are presented in Table 2. 


TABLE 2 
PERFORMANCE CATEGORIES OF SUBJECTS FOR FOUR AGE LEVELS 


vrai) 2 Whe wu do Bese ec Oiee, my 4 


VI Mastered all items 
using proportions 


V Mastered unit, non-unit 


— xxx xx 
and composition 
IV Mastered unit and 4 oe ee 
composition 
ICAL Mastered unit tasks x xx xx xxx 
IL Mastered “4:" tasks Xxx x 
only XXXKX XXXXX XxKX xx 
Jk Mastered no tasks Xxx x 


a 
y = years of age 


Some of the categories require further explanation. Category II refers to 
the direct and inverse tasks using the “1/2” machine. Category III concerns 
“1/2” and “1/3” tasks, involving fractions with unit numerators. Category 
IV again involves the unit tasks of III but also includes the composition task 
“1/3 x 1/2”. Categories V and VI involve mastery of direct and inverse 
questions on all six tasks. Subjects in the former category solved the problems 
using numerical comparisons, while those in category VI would consider the 
operator as generating a proportion. 

At each age level, performance ranged over several performance 
categories and there was overlapping of performance among all age levels. 
However, of subjects over 12, 57 percent are at level 4 or above while only 5 
percent of subjects under 12 achieved that level. 
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Analysis of Problem Areas 


By the design of the tasks, a comparison of performance on direct and 
inverse tasks and unit and non-unit tasks was possible. Rather than make a 
gross numerical comparison, performance on task pairs by individuals was 
considered. These data are presented in Tables 3 and 4. 


TABLE 3 
COMPARISON OF PERFORMANCE ON PAIRED DIRECT-INVERSE ITEMS 
Inverse 
Yes No 
Yes 81 113} 
Direct 
No al 58 
x2: IOVS EG Foy BS slOnil 
TABLE 4 
COMPARISON OF PERFORMANCE ON PAIRED UNIT AND NON-UNIT TASKS 
Non-Unit 
Yes No 
Yes 15 Suk 
Unit 
No 1 Sy 


4° Jhb: OL < 7,08 


There is a significant relationship between performances on the two task 
types in both tables. However, for the direct-inverse comparison most 
students were either correct or incorrect on both items of a pair, while for the 
unit—non-unit comparison, almost half of the students got the unit task while 
missing the paired non-unit task. These relationships were similar at all of 
the age levels with, of course, the ‘‘yes-yes” tally increasing with age in both 
comparisons. 


An analysis of the verbal descriptions of subjects of the way in which the 
machine functioned or their verbal response to items revealed two important 
trends. The first, which has been seen in other proportionality research, is 
that students thought subtractively and not multiplicatively in analyzing the 
problem settings. This was particularly so for students under 12 years of 
age. For example, in looking at the “2/3” operator such a student would say 
‘it’s subtracting 4” (12 > 8), “it’s subtracting 10” (30 — 20) and thus never 
focus on the constant multiplier involved. 


The second important feature, was the role “1/2” appeared to play in the 
subjects’ thinking. As can be seen from Table 2, over 91 percent of the subjects 
mastered the “1/2” task. Further, although students stated that they knew a 
machine was not a “1/2” machine, when confused they would give a “1/2” 
response. For example, with an input like 24 on the “3/4” machine, the 
student would predict 12 as an output. This kind of response was made by 47 
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percent of the entire sample and was particularly evident for students under 
12, where 63 percent of the students made this kind of response. 


As suggested earlier, a very tentative exploration was made of the 
subjects’ ability to learn quickly how to function with an operator (on the first 
task over which they did not show mastery). Of the sample, 27 subjects 
showed this ability, with all students grade 6 or above who were given brief 
instruction, learning how to execute the operator. Below grade 6 only 6 of 15 
students were able to learn how to execute the operator after the brief 
instructional sequence. In all cases tested, this “learning” only held for the 
direct aspect of the operator involved with only a very limited carry-over to 
the inverse or other tasks. 


Discussion 


The results taken from the data above indicate a strong age and stage 
development of students’ reactions to the operator interpretation for rational 
numbers as given in this study. While, of course, this could be an artifact of 
the task design, it is probably more closely related to the onset of formal 
operative thinking. The performance levels indicated by V and VI in Table 2 
require that the student simultaneously compare two phenomena and inter- 
relate them. This performance is comparable to the logical thinking described 
in several of the experiments in The Growth of Logical Thinking from 
Childhood to Adolescence (Inhelder & Piaget, 1958). 

Taken in this light one can hypothesize three levels of rational operator 
development. First, there is the “‘1/2” oriented level where a child’s fractional 
conceptualization is dominated by “1/2”. The second level is transitional and 
linear in nature in which the subject can handle units and compositions of 
unit operators. The third level is ability to handle compositions with all forms 
of operators. Although not observed here, there is probably a level 
incorporating operators into a system which is beyond those considered here. 


A study of the interview transcripts reveals that thinking of operators as 
ratios is not necessary for achieving at the third level discussed above. In 
fact, only one student, a 13-year-old girl, indicated use of proportions in 
analyzing the machines. She was also the only one who named the 
composition of “3/4” and “1/2”, “3/8” after one trial. Other subjects appeared 
to use a partitioning mechanism to get at non-unit operators. For example, 
another 13-year-old girl in looking at a 12 input, 9 output situation said “2 into 
12 is 6, but 2 doesn’t work with 9, 3 into 12 is 4, 3 into 9 is 3 — oh, I see” and 
then she applied this to prediction tasks by dividing by 4 and multiplying by 
3. 

The research discussed above was exploratory and the data gathered from 
a small sample of subjects. Yet it would appear that in building curriculum for 
the operator construct, development cannot safely be ignored. It would also 
appear that early instruction building the child’s concept of fraction beyond 
“1/2” and instruction in various partitioning tasks would be fruitful. Further, 
because of the variety of performance levels at each age level, individualiza- 
tion in terms of both goals of instruction and student strategies (partitioning 
vs. proportionality) in the operator construct appears to be necessary. 


* This study was undertaken under the auspices of a sabbatical leave grant from the University of Alberta and with 
administrative assistance from the Center for the Study of Mathematics Learning and Teaching at the University 


of Georgia. 
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Note 


1Dr. I. Weinsweig, University of Illinois-Chicago Circle and Dr. K. Fusen, 
Northwestern University provided helpful criticism. 
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Effects of Play on Novel Responses 
in Kindergarten Children* 


One hundred and twenty kindergarten children were randomly assigned to 
four treatment conditions: free play, make-believe play, imitation, and control. 
They were then given an alternate-uses test and the objects were: a paper towel, 
a matchbox, a paper clip, and a screwdriver. Subjects in both the free play and 
make-believe play conditions produced significantly more nonstandard 
responses for paper clip than control subjects. Make-believe play subjects 
produced significantly more nonstandard responses than control and free play 
subjects for screwdriver, the only object not present in the treatment 
conditions, suggesting a carry-over effect of make-believe attitude or 
playfulness. (Dr. Li is Associate Professor in the Department of Educational 
Psychology at the University of Calgary.) 


There is tentative evidence for the view that play is associated with the 
emergence of novel responses, responses which may be employed to good 
effect on creativity tests (Sutton-Smith, 1973). Recent studies giving support 
to this hypothesis include those by Lieberman (1965), Sutton-Smith (1967), 
and Dansky and Silverman (1973). While Lieberman (1965) found a 
significant relationship between playfulness and divergent thinking in 
kindergarten children, because intelligence loads heavily on her playfulness 
scales, the exact relationship between playfulness and divergent thinking 
cannot be determined. 

Dansky and Silverman (1973) examined the differential outcomes of 
playful and imitative activity and found that children in the play condition 
gave significantly more nonstandard uses for the objects on an alternate-uses 
test than children who used the objects in the imitative context or children 
without prior exposure to the objects. However, one cannot be certain in what 
way the effects of the play treatment were transferred to performance on the 
alternate-uses task, for example, whether the play subjects might have 
brought a more playful attitude to the alternate-uses task than the imitation 


and the control subjects. : 
The present study was undertaken to extend Dansky and Silverman’s 1973 
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study, to test the hypothesis that a playful attitude may result in a general 
lowering of a person’s threshold for novel associations, by including in the 
alternate-uses task an object not present during the play or the imitation 
condition. The Dansky and Silverman study employed only one play 
treatment condition, which may be described as free play, for the subjects in 
the play group were told, “You may play with all of these things. Do whatever 
you would like to do with them.” In other words, the subjects are provided a 
free play situation, with freedom to engage in spontaneous play which may or 
may not involve ‘make-believe’ elements. Qualitatively, there are several 
types or kinds of play, and the term “play” in Dansky and Silverman’s play 
condition is probably used in the generic sense. Singer (1973) believes that 
children do not automatically learn how to play in a make-believe way, even 
though a certain amount of make-believe develops from the intrinsic nature of 
childhood growth. Furthermore, it is the make-believe aspect of play which 
may be associated with divergent thinking. Therefore, another purpose of this 
study was to focus on the make-believe aspect of play and to explore the 
differential effect on children’s novel responses of two types of instructions 
for introducing the play treatment. One may be described as providing a free 
play situation while the other attempts to orient the child towards “‘pretend”’ 
or make-believe play. 


Method 

Subjects 

The sample consisted of 120 children, 60 boys and 60 girls, aged between 4 
years 11 months and 6 years 1 month. They were randomly selected from 10 
community kindergartens which had been randomly chosen from all the 
community kindergartens in Calgary, Alberta. By community kindergartens 
is meant community-based programmes operated by community 
associations, as opposed to board-based ones, operated by the school boards. 
At the time of the study there were many more community-based programmes 
than board-based ones. 


Subjects were randomly assigned to each of four treatment conditions: free 
play, make-believe play, imitation, and control (N = 30 in each condition, half 
boys and half girls). 


Materials 


For the free play and make-believe play conditions, the materials were a 
pile of 10 paper towels, 10 empty matchboxes and a pile of 30 paper clips. For 
the imitation condition, 10 paper towels, 10 empty match boxes, and 30 paper 
clips were used, in addition to 6 wet plastic cups, a bunch of toothpicks, and 15 
blank cards (3 x 5 inches). For the control group, the materials consisted of a 
box of colouring pencils and 4 sketches. A coloured screen was placed on the 
table facing the subject during testing. 


Procedure 

The procedure was similar to that used by Dansky and Silverman (1973). 
All subjects were seen individually twice by a female graduate student: once 
for an initial rapport-building 5-minute colouring session and then a week 
later during the experiment proper. 
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For the second session, free play subjects were presented with the stimulus 
materials and told, “You may play with all of these things. Do whatever you 
would like to do with them.” The free play period lasted 10 minutes. The 
subject could use the objects as he chose but if he disregarded any of the 3 
objects, one or two prods were given in the fourth or seventh minute of the 
session, by naming the objects the subject had not used and indicating they 
could be played with as well. 


Subjects in the make-believe play condition were told at the start of the 
second session the following story: “This is a story about a very, very clean 
lady. Once there was a very, very clean lady who scrubbed and scrubbed her 
kitchen table so hard that she wore a hole in the middle of the table. She 
scrubbed and scrubbed her clothes so hard that they fell apart. She scrubbed 
and scrubbed her face so hard that .... guess what... . her nose fell off! Can 
you imagine that? .... Do you like playing make-believe?” Then the subject 
was presented with the stimulus material and told, “Let’s make believe or 
imagine that an object could become anything you would like it to be. Play 
with all of these things. Do whatever you would like to do with them.” The 
play period lasted 10 minutes altogether, including the story-telling. Again, if 
the subject disregarded any of the objects, one or two prods might be given as 
in the free play condition. 

Subjects in the imitation condition were asked to watch the experimenter 
while she performed three tasks (fastening any three cards with a paper clip, 
three times; wiping all six cups with a paper towel; and putting a few 
toothpicks in an empty matchbox, doing it three times). The tasks were 
performed in random order. Then the subject was told, “Now you do it in 
exactly the same way that I have done with these things.” 

The control subjects were given a box of colouring pencils and four 
sketches to colour as they liked. 

The four treatments were administered in blocks of four. Within each 
block, the first subject seen was always from the free play group, followed by 
one from the make-believe play group, then imitation and lastly, the control 
group. Each treatment lasted 10 minutes, and the amount of time the subject 
spent in playing with each of the three objects was recorded. 


The Dependent Measure 

An alternate-uses test was employed as a measure of associative fluency. 
The four experimental objects for the test were: a paper towel, a matchbox, a 
paper clip, plus a screwdriver, which was not used in any of the treatment 
conditions. The test was given to each of the subjects by the same 
experimenter immediately after the treatment condition. The experimenter 
randomly selected one of the four objects and showed it to the subject, saying, 
“You can use a in lots of different ways. I would like you to tell me all of 
the things that you could do with it, make with it, or use it for.” Once the 
subject stopped responding, he was asked, “Could you do anything else with 
it?” The subject was allowed to see, but not handle, the objects during 
questioning period. The same procedure was repeated for each of the four 
objects. The amount of time taken by each subject to complete the alternate- 


uses task was recorded. 
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Results 


All responses were recorded on audio tape and were scored by two 
independent judges as either standard or nonstandard according to a method 
used by Goodnow (1969). Novel responses would be the nonstandard 
responses. Neither of the judges knew the subjects’ treatment grouping while 
doing the rating. The judges agreed on 97% of the total 2,070 ratings. 


Standard and nonstandard responses were analyzed separately. Mul- 
tivariate analysis of covariance performed on the standard scores did not 
yield any significant treatment effect, which was expected. But there was a 
significant sex effect in that the male subjects (mean = 2.17) produced more 
standard responses than female subjects (mean = 1.43) for the object paper 
clip, F(4,109) = 7.73, p < .01. There was no significant treatment x sex 
interaction effect. 


For nonstandard responses, no significant sex difference was found. Thus 
the sexes were combined for subsequent analyses (N = 30 in each condition). 
One-way MANOVA performed on the nonstandard scores yielded a 
significant overall treatment effect. Further inspection of the data showed a 
significant treatment effect for paper clip, F'(12,299) = 3.27, p < .02, and for the 
screwdriver F'(12,299) = 2.88, p < .04. Multiple ¢ tests indicated that for paper 
clip, subjects in make-believe play condition produced significantly more 
nonstandard responses than subjects in the control condition (p< .03), while 
free play subjects produced significantly more nonstandard responses than 
both the control subjects (p < .01) and the imitation subjects (p < .04). For 
screwdriver, make-believe play subjects produced significantly more nonstan- 
dard responses than either the free play (p < .01) or the control subjects (p < 
.05). The mean number of nonstandard responses to each of the four objects 
for each of the four conditions is presented in Table 1. Analysis of the amount 
of time taken by subjects to complete the alternate-uses task showed no 
significant difference between groups or sexes. 


TABLE 1 
MEAN NUMBER OF NONSTANDARD RESPONSES FOR ALL OBJECTS 


Screwdriver Paper Clip Paper Towel Matchbox 
Make-believe play 13539 G25100) 1s 939 e227) B306— Gri) Zmose #C2.05) 
Free play Os45 08 CO 7) PESO APRS Shee CCsiae259) 2a SOL B24.) 
Imitation O89 TE92) RG a KC5 3) 226 ae (22) 193 GSS) 
Control Wavsh » COREE) O290 5.30) 2100 (268) 1.80 (2.40) 


Note. Standard deviations are given in brackets. 


Discussion 
Analysis of the data yielded a significant treatment effect for nonstandard 
responses for screwdriver in favour of the make-believe play subjects, and for 
nonstandard responses for paper clip in favour of both the free play and 
make-believe play subjects. With regard to standard responses, no treatment 
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whee was found but a significant sex effect was found for paper clip in favour 
of boys. 


The results of this study lend some support to the hypothesis that a playful 
attitude facilitates associative fluency. The findings of a significant 
treatment effect for screwdriver in favour of make-believe play subjects is of 
importance. Since none of the children had played with or handled the 
screwdriver during the treatment session, the responses to the alternate-uses 
test for this object might have been affected by the child’s frame of mind or 
attitude during treatment. The introduction of make-believe elements in the 
instructions seems to have greater facilitating effect than just asking the 
subjects to play with the objects. One might infer a possible transfer effect of 
a make-believe or imaginative playfulness. The results are consistent with 
that of a recent follow-up study by Dansky and Silverman (1975) which used 
objects not involved in any of the treatment conditions for the alternate-uses 
task. An intellectual task condition was added to the play and the imitation 
conditions. The only variable differentiating the three groups was the nature 
of the activity engaged in before the subjects were given the alternate-uses 
task. They found that the subjects in the play condition produced significant- 
ly more standard and nonstandard uses than did the subjects in either the 
imitation or intellectual task condition. 


The unexpected finding of a significant sex effect for standard responses 
for paper clip is somewhat difficult to explain. A visual inspection of the mean 
standard uses for this object shows that boys give more standard responses 
than girls for every one of the four conditions. But an inspection of the 
nonstandard scores shows that while there was no significant sex effect, the 
boys’ mean for nonstandard uses is higher than that of girls for the free play 
condition, but the reverse is seen for the make-believe play condition. This, 
together with the significant treatment effect for nonstandard responses, 
suggests that instructions introducing make-believe elements into the play 
condition may have greater facilitating effect for girls than for boys in terms 
of generating nonstandard responses. However, the need for caution in such 
an explanation is recognized. In summary, the present findings can be said to 
add to a growing literature supporting a relationship between play and 
divergent thinking. However, to determine more exactly the differential 
effects of free play and make-believe play on associative fluency, a more 
powerful design would be needed. 


*This study was supported by a University Research Grant from the University of Calgary. 
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Planters and Pedagogues: The Social and 
Political Implications of Education 
in British West Indian Slave Society 


This article utilizes the findings of a far-reaching and detailed analysis of 
evangelical missionary education in the British West Indies during the years of 
slavery and apprenticeship, 1800-1838. The archives of four London-based 
missionary societies, as well as primary source materials from other 
organizations, provide the basis for the argument. Two contending groups, 
planters and missionaries, proved to have antithetical assumptions about the 
provision of education to slaves. Missionaries in this case did not always 
operate in the interests of the dominant white power group and neither did they 
unequivocally inculcate traditional values which reinforced the existing social 
order. (Dr. Rooke is a Sessional Instructor in the Department of Educational 
Foundations, The University of Alberta.) 


The path to educating, even christianizing, another man’s “property”’ is 
fraught with difficulties, for the common presumption in a slave society is 
that “knowledge and slavery are incompatible” (Whiteley, p. 13; Sydnor, p. 
53). Just as the whip is indispensible to slavery so is ignorance necessary for 
subservience. For one group to maintain absolute tyranny over the persons, 
production, and reproduction of another group, they must also possess the 
means of preserving that power. Such preservation included a monopoly over 
law and education. 

Until 1823, when the official Church of England in the colonies was 
directed under Act of British Parliament to increase religious instruction to 
bondsmen, custom and sanctions against such instruction had minimized 
any opportunities. The slave laws of the American South had expressly 
forbidden any education to slaves. In the case of the sugar colonies, a similar 
preclusion modified only as public opinion increased in Britain in favour of 
slave emancipation. With this changing political climate both planters and 
clergy came under attack for their neglect and disinterest in converting 
slaves. Abolitionists, evangelical churchmen, liberals and missionaries alike 
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were committed to the belief that a moral and religious preparation for 
freedom was imperative. 


The planters were disgruntled at the public debate on gradual emancipa- 
tion but appeared amenable to the British directives. However, this was an 
insincere attempt to stem the mounting tide of criticism against the excesses 
and abuses of the plantation system. At no time did their sudden and 
expedient tolerance include any intention of allowing the teaching of reading 
or writing as part of the religious instruction. These remained clearly 
prohibited. In addition to the planter preference that strictly religious 
principles be inculcated by oral means only, they preferred that any such 
instruction be given by Church of England clergy. For example, as late as 
1823, when Parliament was debating the Melioration bill, the planters and 
official church clergy in Barbados were still enjoining missionaries that their 
methods must remain “strictly oral” (Fulham Papers). The Colonist in the 
same year was insistent that if slaves must have religious instruction then it 
must come from “regular bred clergymen” and not from missionaries. It 
believed evangelical preachers found it impossible to preach “‘the pure word of 
God, unpolluted with sedition.” 


The dominant white class in the British West Indies viewed evangelical 
missionary education with profound suspicion, and literacy as the initiation 
of the negro population into subversion. 


It remains to ask several questions regarding missionary education to 
slaves. (1) In which ways did it reinforce the prevailing assumptions of slave 
society? (2) To what extent were missionaries perceived as being conservers of 
the existing school order? (3) Did the missionary group identify themselves as 
part of the dominant white power group? (4) What was the response to 
missionary education? 


Missionaries and the Prevailing Assumptions 
in Slave Society 


The first question is perhaps more crucial than the others for in many 
ways the answer to the others rests on the first case. It is apparent that 
“education” of any sort does not reinforce the prevailing assumptions either 
about slave societies, or about slaves. It is, indeed the very antithesis to them. 
Today we are often tempted to dismiss missionary education as mere 
“indoctrination.” It is quite certain, however, that planters were unwilling to 
see it so simply. They found it impossible to dismiss and saw whatever was 
going on in the chapels and schoolrooms as distinctly problematic. Had only 
those Christian precepts such as obedience, submission, and gratitude been 
indoctrinated, missionaries would have been both applauded and welcomed. 
Instead, they were detested and subjected to hostile social pressure. Once 
reading and writing were introduced, then clearly the planters viewed 
something as occurring which was more than mere indoctrination. 

Richard Watson, Secretary of the Wesleyan Methodist Missionary Society 
(WMMS), shrewdly observed that the white class, knowing “that knowledge 
was power,” could not “conceive of how this power could be communicated” 
without endangering themselves. Thus, “the prejudice against schools [was] 
much stronger than against preaching” (1817, p. 9). Mr. Raynor, a Methodist 
in Antigua, agreed. “Nothing can prevent the Creole Negro from gaining that 
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power while he has intercourse with those who possess it” (Item 31, 1814). The 
missionaries were not only preachers but also pedagogues who taught 
systematically, deliberately, and according to the common methods of their 
day. Although they trusted much to the power of the Holy Spirit, they did not 
rely entirely upon it to convert slaves! Their content was predominantly 
scriptural but it is known to have included reading and writing and, on 
occasion, history, geography, arithmetic, and natural science. 


In the light of those constraints placed upon teaching literacy skills it 
came about that missionaries most strongly objected to slavery itself. Slavery 
interfered with teaching slaves to read, and reading the scriptures was 
deemed imperative to Evangelical Protestantism. Scripture reading was seen 
as a prerequisite for conversion. Therefore, missionaries strenuously urged 
the planters to lift the prohibitions placed upon teaching literacy. However, 
the only methods of instruction “remotely approved” of by the owners of 
slaves were either oral means or by catechising. It might seem that the 
injunctions on catechising would be satisfactorily received by missionaries; 
in fact this was not the case. Generally they agreed with a description of 
catechising published in the Anti-Slavery Reporter of 1831. “Catechising”’ 
was agreeable to the planters only because it was “‘a most ingenious device for 
training teachers to teach nothing” (p. 472). Missionaries such as John Wray 
thought that catechising was a useful method only if used as an instructional 
aid. They had grave reservations about using catechising as the sole method 
of transmitting religious precepts and considered it an inadequate method for 
conveying other subjects. Furthermore, although Wray’s licence was under 
“strict injuction” not to introduce literacy skills, he cheerfully ignored such 
an order, as did most of the other missionaries. 


If it is somewhat surprising to find missionaries reluctant to catechise 
solely, it comes as a further surprise to see the methods they did prefer. By no 
means did these men only preach the gospel or convey their message through 
exhortation and moral suasion. By far, they preferred to use the familiar and 
carefully systemized pedagogical methods of their day. These consisted of the 
Lancastrian or British System. In 1809, John Davies of the LMS had already 
printed his own series of lessons based on those of the British and Foreign 
Schools Society (BFSS). These lessons were adapted to the children “of the 
Black and Coloured Population.” In addition to these, the London Missionary 
Society (LMS) men in British Guiana used the Infant Method which was 
particularly innovative even in Britain itself. As this method had been barely 
introduced to the Mother Country in the twenties, the dissemination of its 
principles into Berbice and Demerara in the early thirties is noteworthy. 
Between 1830 and 1834 most LMS men were proving themselves advanced 
pedagogically by using a method, the Infant Method, based upon Pestaloz- 
zian object lessons and sense-data principles, the use of illustrative teaching 
aids such as maps, charts, and pictures, as well as the encouragement of 
rhythmic exercises. 

A basic assumption of missionaries was that all men were equal in the 
sight of God, therefore it was necessary to bring all men out of ignorance so 
they could either choose or reject Christianity. For slave owners such an 
assumption was objectionable because their economic and social system 
operated on the opposite assumption — that of the basic inequality of men. To 
teach a slave that he was spiritually equal to a white man was seen as the 
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first step to having him wonder why it was that some men could own another 

group of men. This sentiment was aptly expressed in The Colonist, 1823. 
For a WHITE man and a minister to take a negro slave by the hand, and ask him 
how he does is a compliment which he seldom meets with and can very rarely 
expect; to address a promiscuous audience of black and coloured people, bond and 
free, Christians and heathens, by the endearing appellation of ‘““My brothers and 
sisters” is what can be nowhere heard except in Providence Chapel, or peraventure 
Ebenezer Chapel. 


Planters had always been concerned about the “consequences” of 
christianizing slaves, as a statement in the Royal Gazette of 1808 indicates. 
“Will not the negro conceive that by baptism .. . he is as creditable as his 
Christian white brethren?” Thus we can understand planter resentment of a 
group of white men who obviously believed otherwise about the nature of 
negroes and slaves. Missionaries saw the slave neither as noble, as advocates 
of the “Noble Savage” would depict him, nor as brutish, as others would make 
him. Therefore they sought to christianize slaves because Christianity was 
believed to be synonymous with civilization. Here was yet another 
assumption in disagreement with those prevailing in slave society. Planters 
rejected ideas of civilizing slaves for they were seen as “wild, savage, and 
barbarous in the extreme.” Civilization implied capacity for some refined 
learning and intellectual proficiency, neither of which the missionaries 
doubted negroes were capable of. Planters were convinced that only “the iron 
rule of harsh authority” could restrain their “unexampled atrocities.” That 
was why their slave laws were “rigid and inclement” (Bridges, The Annals). 
Missionaries insisted that the opposite was the case. Christianity would, said 
the Secretary of the Wesleyan Methodist Missionary Society (WMMS), 
“modify violent passions, and reform laws.” It would “civilize, cultivate 
knowledge, morality, and domestic habits” (Watson, 1824, pp. 13-17). As 
James Stephen had observed in Slavery in the British West Indian Colonies 
(1824), ‘““Mere interaction with the white race” would not civilize slaves but 
only “religious instruction could” (p. 202). Those heathens, previously 
“incapable of culture,” must be brought to civilization by stressing the 
lessons of morality and piety (Observations, 1827). 


There is much to be said concerning the racial attitudes of missionaries in 
the West Indies. We might not fully agree with O. W. Furley (1965) that they 
were “pioneers of a new racial society.”” However, neither can we deny that, 
for want of a more historical term, there is sufficient evidence that on the 
matter of race, missionaries “were ahead of their time.” Certainly their 
attitudes towards matters of race were based on distinctly different 
assumptions than those of the broader society. Although an essay on this 
aspect could be documented, let it suffice to observe that there is that 
evidence. There is, however, also evidence demonstrating a subtle but 
discernible shift in their racial attitudes during apppenticeship, 1834-38. A 
niggling ambivalence which can be observed during slavery finally asserted 
itself during this crucial period as another sad commentary on the tensions 
between the theories and practices of many idealogues, religious or secular. 
Nevertheless, planter perceptions of missionaries included notions of a racial 
egalitarianism their society could neither understand nor withstand. In a 
world whose structures and conventions seemed crumbling around them, 
they could not distinguish spiritual impartiality from socio-political equality. 
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They could not construe where fine words such as Knibb’s might lead. “I look 
upon the matter of slavery,” he said before a meeting at Exeter Hall, August 
15, 1832, “as one of religion and morality. All I ask is that my African brother 
may stand in the family of man” (Religious Persecution, p. 9). Planters’ fears 
were in some ways exaggerated and unwarranted. Almost certainly the 
missionaries were men enough of their time and culture not to believea negro, 
slave or freed, was actually as good as an Englishman. The words of a sermon 
in 1817 probably typified most of our missionary samples on the matter of 
absolute equality. 

Who would exchange the general character of Britons for the cozenage and perfidy 

of the Chinese, for the servility and duplicity of Hindoos, or for the brutish ferocity 

of American savages? (Harris, p. 125) 


During apprenticeship, for example, a coloured catechist, William Trotz 
was described as his “piety” being “fine,” but his “proficiency [was] limited.” 
Another coloured catechist was definitely kept in his place by the Director of 
the LMS. The Director sanctimoniously reminded the catechist, who asked to 
be ordained a minister, that God’s wisdom had created “the various orders of 
men” (Haywood, 1839; Ellis, 1838). Naturally the coloured catechist was 
expected to be among the least significant of these orders. Nevertheless 
during slavery such reservations were less commonly made public. Indeed 
they are unusual to find even in the private sentiments of missionary 
correspondence. The words of John Armstrong of the CMS were more typical. 
He exclaimed that slaves were not ‘“‘a set of stupid automatons which can 
never learn nor are fit to be loose” (1830). Mr. Raynor of the WMMS had 
claimed some years before that many people of colour had “more correct 
knowledge of men and things as well as doctrines and language than many of 
our preachers at home and are much more ready to notice defects in the above 
particulars than the greater part of our regular hearers in England” (Item 31). 


In slave society, a slave was listed in an inventory, below buildings and 
land, and just above the livestock (Walker, p. 159). The white classes justified 
their dominant position on pseudo-scientific theories which graded the slave 
just above an “orang-outang” or of a separate genus. Some theology justified 
the condition of slavery based on colour because negroes were seen as the 
descendants of Ham, Cush, and Put. In such a society missionary education 
takes on added significance and can be seen in terms other than mere 
proselytization. It was a visible and defiant contradiction of many of the 
society’s prevailing assumptions. When missionaries introduced literacy 
skills into their schoolrooms it was no longer just a contradiction. It was seen 
by the dominant white society as an insolent threat to social order. 


Missionaries as Conservers of the 
Existing Social Order 
Any education which postulated assumptions about the moral, intellec- 
tual, and spiritual equality of slaves and free white men cannot be seen as 
conserving the existing social order which presupposed the very opposite. As 
religious instruction implies at least rudimentary introduction to formal 
learning, we come back to that missionary assumption which reveals the 
main source of contention between missionaries and planters. That 
assumption asserted slaves could be civilized and must be Christianized. 
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Planters feared that learning might lead to “uppity” or cynical slaves. If 
reading intruded, this might well lead to civil unrest, for who knew what 
might be read and by whom, or for what purposes? (Waddell, pp. 17, 34; Wray, 
1813; Smith, 1818). Under the circumstances it cannot confidently be asserted 
that missionary education was either conserving the existing society or 
perceived to be by those whose interests were best served by conserving it. 


Had missionaries merely believed that slaves were equal to whites in the 
sight of God, few planters would have been unduly disturbed. However, these 
zealots did not only believe this but they acted upon such beliefs. Missionaries 
built schools and chapels, nursed the sick, cared for the aged, instructed the 
youth, socialized with the meanest slave, and lived in close proximity to the 
most contemptible stratum of society. They “ingratiated themselves with the 
lowest class’ (Van Cooten, 1815). Understandably, even if quite mistakenly, 
such acts of charity were thought to be signs of political radicalism. Although 
we, as latter day observers of past events, might smile a little at the prospect 
of political radicalism among nineteenth century evangelical missionaries, 
we must not be too hasty or too supercilious. Let us then examine the 
evidence. 


The missionary record on the subject of slavery is not as negligent as 
might be suspected. As early as 1811, Wray returned to England to complain 
about conditions on plantations. Many missionaries lodged official com- 
plaints about conditions and treatment of slaves and about abuses by 
particular managers, attorneys, or planters. Several published exposés in the 
British press and were themselves tried for libel and slander by those planters 
they exposed. Breaches of trust over the rights of slaves to practise 
Christianity were brought before the Colonial Office by missionaries 
(Reckord, 1971). William Knibb was rumoured in the British and Colonial 
press as contributing his and his Church’s leadership to the Jamaica 
Rebellion in 1831. Although the rumour was unfounded, the militancy of some 
missionaries in this period was not. Knibb and another Baptist, Thomas 
Burchell, campaigned against slavery in England before Emancipation came 
in 1833 (Narrative, p. 18). Evangelical missionaries were quickly perceived by 
the planters at least to be exceedingly sympathetic to the Abolition of Slavery 
campaign. John Davies wrote in 1812 that the Abolitionists were ‘“‘the friends 
of humanity” and that he loved William Wilberforce “from the heart.” Wray 
complained about treatment of slaves and religious persecution to such 
eminent abolitionist leaders as Wilberforce, Macaulay, and Stephens. He also 
lodged official complaints to the Governor of Berbice about restrictions on 
religious liberty. Thomas Jones of the CMS also wrote to abolitionists such as 
Thomas Fowell Buxton and Henry Brougham in 1826 (CW/051/2). 

The official instructions from the London missionary societies insisted 
that their agents were not to involve themselves in political or secular affairs, 
but the actions of all missionaries was not always in accord with such 
instructions. Therefore the official statements might give an inaccurate 
picture of missionary collusion. For example, W. K. Betts of the CMS had been 
cautioned in 1825 “to use peculiar circumspection and prudence” concerning 
the temporal conditions of slaves (Proceedings, 1825, p. 148). In turn the CMS 
monthly meeting of November, 1828, re-issued instructions warning its 
agents to avoid “secular concerns and political affairs.” All societies during 
the years of slavery gave their missionaries similar instructions and 
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warnings. Nevertheless, the missionaries particularly ignored such warnings 
when it came to the restrictions on reading. Moreover, they taught reading at 
most of the missions. “True Believers,” despite restrictions, are prone to 
overcome obstacles, either by defiance or circumvention, and missionaries 
proved no exception. Though managers and governors themselves might be 
“adverse to reading,” John Smith admitted that they “taught it by Stealth” 
(1823). Wray argued with Governor Murray of Berbice that the Governor’s 
fear of “bad books” was quite unfounded, given that “good books would instill 
good principles.” He rather indiscreetly added that domestic slaves learned 
no more vice from bad books than by “listening to the white population” 
when they served their masters at table (1813). 


Given the pressures against teaching reading, it is really astonishing how 
many missionaries either flagrantly or clandestinely disobeyed instructions 
regarding the matter. Many of these made no efforts at all to disguise the fact. 
Indeed their Missionary Society journals such as The Missionary Herald, The 
Baptist Magazine, and the Evangelical Magazine frequently contained 
extracts from their correspondence which described their pedagogical 
practices, and the teaching of literacy skills was frequently among them. The 
argument might be made that missionaries taught literacy only when the 
political climate in England was against the planters and there was much 
agitation for gradual emancipation. During the heightening political 
campaigns, missionaries gained much public support because of several 
unfortunate incidents of persecution against them. In Demerara a LMS 
missionary had been so harassed and intimidated that following the 1823 
Demerara Slave Revolt he and some Christian slaves from his chapel were 
arrested as chief instigators of the insurrection. Governor Murray of British 
Guiana wrote to Lord Bathurst in 1823 and said, “It is evident that this 
mischief was plotted at the Bethel Chapel . . .” (CO:111-53, p. 10). Smith was 
sentenced to hang but was granted mercy. However, he died in prison and his 
case raised a hue and cry which resounded in abolitionist circles.! In the same 
year a Methodist, Mr. Shrewsbury, was hounded off the island of Barbados 
after a riot which his presence in the colony had purportedly precipitated. His 
Methodist chapel was totally destroyed (An Authentic Report, pp. 17, 59). 
Both were well published events. Nevertheless, the sympathy they roused in 
the twenties does not explain why the missionaries defied prohibitions on 
reading even at the outset of their missions. It only explains why they grew 
increasingly more bold about the matter during the twenties. 


The following are some illustrations which reveal the extent to which 
reading was imperative for missionary education. Wray, of the LMS, in 
Berbice, was teaching reading according to the Lancastrian Plan as early as 
1813 (Extracts). Phillippo, a Baptist in Jamaica, operated his day and Sunday 
schools from 1825 on with a mixture of National and Lancasterian methods 
(Underhill, p. 75). Hurst, an Antiguan Methodist, was requesting a printing 
press for Sunday school lessons in 1814 (Item 58), and Dawes of the CMS, a 
few years later found the National System “objectionable” and his own 
reading methods preferable (1828, pp. 24-27). In 1810, John Davies of the LMS 
was asking for a shipment of Murray Spellers and four years later, for 
copperplates to teach the Langford system of writing (1810, 1814). In 1814, the 
Baptist Missionary Society (BMS) requisitioned 200 copies of Watt's Hymns 
and 300 copies of Rippon’s Selections and scores of tracts for Jamaica (BMS 
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Proceedings, p. 90). One cannot suppose these quantities were ordered not to 
be read. The documents are replete with similar evidences and it remains 
puzzling why some scholars still insist that catechizing was the preferred 
method and that missionaries were deferential to planter authority in all 
things. 

So far it has been demonstrated that the missionaries’ presence in the 
sugar colonies ran counter to the prevailing assumptions of slave society. 
There was dissonance in several areas including the existence of schools for 
slaves and particularly over the teaching of literacy. In addition to 
christianizing and educating slaves, they fundamentally disagreed with the 
rest of the dominant white society on matters of race. Moreover, missionaries 
generally objected to prohibitions on reading and as a rule ignored such 
restrictions which might impede their work. A small body of missionaries 
actually articulated their discontent over slavery and they protested the 
repression of religious freedoms. These evidences put into doubt claims that 
the missionary presence conserved the existing social order. 


Elsa V. Goveia, Slave Society in the British Leeward Islands, (1965) 
argues that missionaries reinforced the existing sanctions to slave society. 
Though the missionaries deliberately stood somewhat apart from the local white 
society . . . they were nevertheless contributing an active and powerful 
reinforcement to the many sanctions already making for the internal stability of 
the slave system. For far from being a threat to the social order, the increase of the 


Christian missions was calculated to maintain and strengthen slave society (pp. 
323-4). 


This may have been the case for the end of the eighteenth century although 
even that is doubtful. It was not the case in the West Indies during the second 
and third decades of the nineteenth century which saw a considerable 
“Increase in Christian missions.” 


Missionary Identification with Dominant 
Power Group 


At this juncture it is appropriate to examine the third problem postulated 
in this paper. To what extent, it asked, did the missionary group identify with 
the dominant white society? It is accurate, of course, to observe that as white 
men, and comparatively educated white men at that, they were logically a 
part of the dominant white class. However, as the preceding quotation tells us 
they “deliberately stood somewhat apart from the local white society.” Their 
alienation was partially self chosen as a type of seige mentality but it was 
also caused by the irritation they caused in others by their rather self- 
congratulatory piety. If missionaries considered that “intimacy” with the 
white population “would not do” (Carter, 1830), neither did the majority of 
planters seek out their company. Thus Smith could lament that he was “as 
much shut out from civilized society” as if he were in the “interior of Africa 
though surrounded by Europeans” (Smith, 1816). Thomas Burchell of the 
BMS observed, “No Englishman would be treated with as much contempt as 
a missionary in the West Indies” (1849, p. 169). Their views of education, 
religion, slavery, morality and miscegenation, were odious to the majority of 
the white population. 


Desegregated chapels were not uncommon in mission stations despite the 
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“offensiveness” of the practice to others (Wray to Burder, 1818). Wray’s 
comment did not pass unnoticed, and neither was it intended to, that “white 
men and coloured women sleep together .. . but will not sit together in God’s 
house” (Wray, 1818). Their emphasis upon continence was only as 
aggravating as their emphasis upon matrimony, given the colonies were not 
partial to either. It was simply intolerable to be publicly told by Wray that 
your society was “a complete brothel” or as Mr. Talboys thundered, that you 
would be “burning in hell” while the coloured population enjoyed “blessings 
in heaven” (Wray, 1816; Talboys, 1816). Neither did missionaries stop at 
telling slaves their master’s duty toward them, with the masters themselves 
present (Knibb, 1825). Similar remarks can be found peppered throughout the 
public record in pages of The Missionary Register, The Anti Slavery Reporter, 
The Christian Remembrances, and The Baptist Annual Register and were 
read by the planting class. The planters did not see missionaries as 
appendages of themselves nor as conservers of social order. In turn, 
missionaries saw the planters as “deists, fornicators, tyrants, and sabbath 
breakers” (Wray, 1818). 

There is evidence which suggests that missionaries did identify with 
conservative social values and even to the extent of inculcating desirable 
social attitudes. Such attitudes were intended to produce an industrious “and 
grateful peasantry.” Thus claims were made by missionaries not only to their 
good intentions but about the laudatory consequences of their systems of 
instruction. William Dawes of the CMS, for example, assured the planting 
class that religious instruction was only intended to make slaves “good 
Christians, and, of course, useful members of society in their humble 
stations” (Item 93). Planters remained unconvinced that slaves would be 
rendered “diligent, faithful, patient, and useful” (Quarterly Chronicle, p. 521). 
Indeed, Wray was accused of teaching the slaves to live “above their stations” 
(CO:318-89). This accusation was gloomily predicated on a general belief in 
England itself about education of the poorer classes. Literacy, indeed any 
form of education, would teach the poor, and certainly slaves too, “to despise 
their stations in life. . . . It would render them fractious and refractory” 
(Hammond & Hammond, 1920, p. 66). 

That John Wray had written a catechism for bondsmen on the subject of 
inculcating sociably acceptable attitudes apparently did not convince the 
planters in the Crown Colony of Berbice either. The catechism included the 
following questions and answers in the section “The Duties of Servants and 
Slaves to Their Masters and Mistresses and Managers.” 

Q. Suppose a servant, or slave, meets with an unfeeling master; does that 
lessen the duty of respect? 

Ans. By no means, for it is the command of God, First Peter 2:18-19, 
“Servants shall be subject to your masters with all fear... 


Q. What is the duty of servants as to the property of the masters? 
Ans. To keep from, and watch against the sins of theft, waste, negligence, 
and to be careful of their master’s property as if it were their own. 


Planters knew such intentions were, in the long run, doomed to failure. The 
Methodist Rev. Hope Masterton Waddell (1863) also knew that ideas about 
producing “happy slaves” were sheer whimsy. A slave might appear docile, or 
grow cunning and calloused, but the urge for freedom was always there. 
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Man is also an improveable animal, and the worse his condition is, he must the 
more desire an improvement of it, to the extent he sees enjoyed by others... . No 
mere improvement of the state of slavery will fully satisfy those subjected to it. (p. 
67) 


Although other missionaries undoubtedly agreed with Waddell’s 
diagnosis of slavery many still insisted that their education was conducive of 
submission and “uniform obedience.” Why then did they so often express 
such sentiments knowing full well, as did Smith, that learning was correctly 
perceived by planters as being in the long run “subversive to good order, the 
slave would be made too wise” (1818). There are three possible reasons for 
either their belief that education to slaves could promote obedience and good- 
will, or for their stubborn reiteration of that belief, although some of them 
may have doubted its probability. 


Firstly, missionaries did not see their role as political. Theirs was an 
expressly moral crusade; they were there to save sinners from the slavery of 
sin and not from the sin of slavery. They may even have genuinely believed it 
was possible to produce good Christian slaves who would accept their 
position in an unjust society by hoping for justice in the after-life. A second 
explanation however, is more likely. Missionaries were “guests” in a hostile 
environment. That is, they were allowed to remain in the colonies at the 
discretion of legislatures which were controlled by planters. They had 
generally come to the Caribbean uninvited and, as the cases of the Rev. 
Shrewsbury and John Smith demonstrate, they could be forced out or 
wretchedly persecuted by choosing to remain. Some missionaries actually 
resided on sugar estates because a few planters requested their services. 
Although the majority of the white class treated missionaries with contempt, 
it was not always so as the “Declaration by Managers, etc., or East Coast 
Plantation of Beterhoop, Demerara,” in 1808 indicates. This declaration 
approved of missionary activities on the sugar plantations as long as it could 
be determined that no reading was occurring and that the cardinal virtues 
were being inculcated. Besides, a few planters were astute enough to realize 
that having a missionary work for them made for dependence and that 
missionary’s activities could be supervised more closely. Missionaries did not 
prefer this situation for it often called for an uncomfortable compromise of 
their principles. On occasion, as with the CMS in Jamaica, some attorneys 
and managers of estates actually sat on the society’s local auxiliary 
committee (Stearne, 1830). They were in the position of reporting back to the 
Parent Society in London if the missionary concerned who worked on their 
estates was not cooperating. In such circumstances some missionaries found 
circumvention of restrictions impossible and aloofness from the white 
population impractical to maintain. 


In the final analysis, the Colonial Legislatures and Assemblies had the 
power to revoke licences to preach and teach or refuse to issue such licences. 
Neither did they stop at wielding this power. Thus we can see the pragmatic 
reasons behind some missionary capitulation and understand the instinct to 
treat warily rather than offend those who could put a whole mission in 
jeopardy. One could not depend on governors’ support. They were constrained 
by their situation, even if against slavery. One governor during the trial of 
Missionary Smith was himself a slave owner and planter (CO:111-46). Smith 
referred to him as a man “who sets his pace against the moral and religious 
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improvement of the slaves, but he himself is a party concerned and no doubt 
solicitous to perpetuate the present cruel system” (CO:111-42). 


Thirdly, those missonaries who insisted that theirs was an education 
which encouraged appropriate Christian values such as gratitude and 
obedience, may have been keeping the safety of their slave converts in mind. 
The Demerara Slave Revolt had been a warning that Christian slaves would 
be the first to be accused of inciting riot in times of political backlash. 
Quamina, the head Deacon of Bethel Chapel, was tried for being the chief 
insurgent. Quamina had been described by John Smith as “‘the most sensible 
negro belonging to Success (plantation)” (CO:111-46). The Slave Rebellion in 
Jamaica in 1831 was significantly called “The Baptist War’ and many 
Baptist and Christian slaves were brutally rounded up after the insurrection 
and executed. Therefore, given that few members of any given group of men 
are willing heroes or martyrs, it is not surprising that some missionary 
sentiment appeared to be capitulating and compromising. Sometimes 
expressions of capitulation proved embarrassing to other missionaries. The 
best known scandal caused by capitulation, and often used as evidence that 
missionaries as a group identified with the white population and supported 
the social order, was the case of the Jamaican WMMS. The Wesleyan 
Committee on that island published a series of resolutions in the Jamaica 
Royal Gazette in 1824. These resolutions included statements which assured 
the planting class that missionaries did not see slavery as incompatible with 
Christianity and that missionaries did not approve of Abolitionist sym- 
pathies. Such resolutions were disavowed by the Wesleyan Conference in 
Britain and the Parent Society. That the Wesleyans involved were severely 
chastised and made to withdraw their statements suggests this case in no 
way typified the general missionary feelings about slavery. Indeed their 
correspondence reveals uneasy consciences and much private agonizing on 
the matter. 


Moreover, missionaries from the four evangelical societies did not 
themselves participate in slave or plantation ownership. Unlike the 
Moravian missionaries who cooperated harmoniously with slave society, the 
four missionary societies, representing Wesleyan Methodists, Baptists, 
Independents, and the evangelical party of the Anglican Church, saw 
participation in either forms of property as “impolitic.” Furthermore, it was 
felt that involvement in the plantation system was “injurious to the cause of 
evangelical religion” (Items 173, 175; Minutes, p. 402; BMS Book 7, p. 136). In 
a society where being “white” automatically guaranteed political and social 
superiority, the rejection of such fundamental economic values of the 
dominant white class set missionaries apart. It may be concluded, then, that 
missionaries did not identify with the dominant power group on matters of 
assumption, practices, or private conduct. ‘Their verbal identifications were 
on matters of cardinal virtues, which it must in all honesty be said, were as 
applicable to white society as to slave society. Slave or free, white or black, 
rich or poor — they were equally required to demonstrate those Christian 
precepts of gratitude and humility. For those less militant missionaries, 
inactivity was probably as much evidence of pragmatism as it was of 
identification with slave owners or slavery. It cannot be seen as tacit 
approval of a system which after all suppressed their reason for being in the 
West Indies. This reason was to teach slaves and convert the “poor heathen.” 
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Planter Response to Missionary Education 

The fourth question this paper addresses itself to is, perhaps, the easiest to 
answer. Indeed it has already been partially covered in the preceding three 
arguments. What was the nature of planter response to missionary 
education? Let us make no mistake about it, the planters muttered about “the 
missionary business here” (Van Cooten, 1815). And they did more than 
mutter. Given the circumvention of laws and prohibitions, of which planters 
were fully cognizant, they saw these intruders as interfering prigs and 
canting dissemblers not above duplicity themselves. The West Indian press 
was virulent against missionary activity which they perceived as a threat to 
social order. Certainly the education of slaves was not generally seen as in the 
best interests of slave society. The Colonist in 1824 printed a vituperative 
article which claimed that missionary education would lead to radicalism. 
Slaves must remain as they were, that is, ignorant, because it was firmly 
believed that “SLAVERY MUST EXIST AS IT IS NOW OR IT WILL NOT 
EXIST AT ALL.” The assumptions of slavery could not accommodate the 
assumptions behind missionary education. In 1817 John Wray was called “‘a 
hired spy” in The Guiana Chronicle and six years later The Colonist was 
continuing the tradition of accusing missionaries of subversive activities. The 
planters sporadically attempted to forbid preaching, restrain religious 
freedoms, harass missionaries, disturb chapel services, destroy chapels and 
intercept missionary mail. 


Missionaries felt sufficiently intimidated to take precautions lest the 
content of their lessons and sermons be misinterpreted. John Smith in 1817 
omitted a passage from a sermon which contained references to God 
“promising deliverance [of Israelite slaves] from the land of Canaan” 
(CO:111-42). He was conscious that his intentions might be misconstrued by 
planters or that negroes might see the analogy and become restless. Planters 
tampered with missionary teaching methods, interfered with the conduct of 
their classrooms, and censored content to be transmitted. The Guiana 
Chronicle in 1824 objected to certain hymns whose verses it felt, could not be 
dismissed as mere theological metaphor. 

Oh when Africa’s sable sons 
Enjoy the heavenly word, 


And vassals long enslaved become, 
The freedom of the Lord. 


In 1832, Lord Mulgrave of Jamaica recorded his impressions of missionary 
presence in the West Indies. Clearly missionaries were seen by planters as 
usurpers of social order. Mulgrave observed that “the greater part” of the 
population was “exasperated to the last degree against the Baptist 
missionaries, whom they firmly believed to have been at the bottom of the 
rebellion ...” (Jones, 1963). Of course it would be absurd to maintain that the 
missionaries actually encouraged civil disorder and violent rebellion, but 
their presence and their teachings were attributed to some power for being 
able to do so. Even those whites who were not “violent” against missionary 
education feared that it must “ultimately have a bad effect” (Smith, 1817; 
Stephens 1824, p. 451). 


William Green’s claim in British Slave Emancipation (1976) must be 
modified. Missionary education was more than just ‘‘a means by which the 
established classes could tame the multitudes, repress social barbarism and 
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preserve their own superior status” (p. 327). Some planters saw such uses 
during slavery but the majority did not. It was only after slavery officially 
ended in 1833 that planters actively sought missionary cooperation in 
training the ex-slaves to industry and subservience. They then saw the 
advantages of a religiously conservative and class-oriented missionary 
education, but that was only after much of their former power had been 
wrested from them. Slavery itself had provided the perfect means for “taming 
the multitudes.” During that period their own “superior status” had been 
preserved by threat and coercion. 


Conclusion 


Missionary education during slavery transmitted traditional moral values 
and conservative religious precepts. Nevertheless, the effects of their 
education cannot be seen in such simple terms as crude social control or as a 
reinforcement of slave society. Their main goal was to convert sinners but in 
doing this they provided a rudimentary education for part of the slave 
population which may otherwise have had none. Because consequences are 
historically more significant than intentions, the number of literates 
produced is more important than the number of converts. This has been 
similarly argued by Thomas Laquer in Religion and Respectability (1976) 
which provides a challenging reinterpretation of the Sunday School move- 
ment. 

Further to this, missionary education was for the most part the only 
means by which slaves and freedmen could enter the white world. We may 
somewhat gratuitously see this entry as capitulation or even as white 
imposition but this would indeed be a hasty judgment. Eugene Genovese has 
argued for the psycho-social significance of “black” religion in his book Roll 
Jordan Roll (1974). The West Indian situation might be similarly explained in 
light of the leadership which came out of the missions during and after 
slavery. If missionary education offered nothing but “‘piein the sky,” it seems 
curious that the planting class was almost uniformly adverse to it. The claim 
has been made by various scholars such as E. P. Thompson (1968), Hammond 
and Hammond (1920), G. Kitson Clark (1967), and Elie Halévy (1961), that the 
Methodist Movement in the British working classes helped produce 
organizational structures, cultural cohesion, future secular leadership (as 
distinct from radical leadership), group identify, aesthetic and psychological 
expression, and written and verbal skills. Much of this, including Genovese’s 
work, has been written from a Marxist perspective. I contend that such 
dimension gives us additional insight into the planters’ hostility toward 
missionary education of slaves. They feared that the lessons of the chapel and 
classroom would create a group of selfconscious subservients. They were 
convinced that these pious pedagogues were “public spies and private 
traducers, troublesome and ungrateful sojourners whose doctrines and 
conduct tended to excite insubordination and rebellion . . .” (Smith, 1817). 

Missionary assumptions and practices were bound to disturb the ordered 
conventions of slave society where masters kept slaves in their place and 
slaves knew their place. The white population’s stereotypes of happy 
“Sambos” and poor “Quashees” were based on a desire to deny them any 
hope for a change in status. Intelligent and astute adult bondsmen responded 
enthusiastically to the only education accessible to them. By all accounts 
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christianized slaves enjoyed those opportunities offered to them through their 
participation in chapel. It must have been significant for slaves to be 
associated on a voluntary basis with a school or chapel. After all, slaves had 
so few opportunities in the ordering of their own lives.? Finally, if 
missionaries “contaminated” the values of Caribbean slave subculture by 
introducing alien European values, let us remember, this was only after they 
had already been contaminated. The dehumanizing processes of perpetual 
bondage had already broken down their own social structures in the 
imposition of the worst kinds of European values. In brief, the planters 
realized that merely by their presence the missionaries were influencing the 
society to be other than it was. 


Notes 


1 Smith was arrested after the 1823 Slave Revolt and died in prison. His case was much publicized in the British 
Abolitionist Press. The Colonist, October 27, 1823, January 1, 1824, and October 13, 1823; The Guiana Chronicle 
and Demerara Gazette, 1824 and March 1, 1824; John Bull, January 26, 1824, p. 36; The Norwich Mercury, June 
14, 1824; The New Times, April 17, 1824 and The Morning Chronicle, June 3, 1824. 


2 Slave response to missionary education has been speculated upon in greater detail in a paper presented to the 1977 
meeting of the Canadian Association of Foundations in Education, Fredericton, 1977, called “The World They 
Made: The Politics of Missionary Education to British West Indian Slaves (1800-33).” 
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Priorities of Teacher Behaviours as Perceived 
by Student-Teachers in a School Based Program: 
An Exploratory Study 


The teacher’s role is comprised of a variety of expectations. The priorities 
assigned these expectations by an instructor provide one means of describing 
his teaching. It may also permit categorizing him as belonging to a group of 
persons who have similar priorities. This paper describes a group of student 
teachers and subgroups within the larger group on the basis of priorities 
assigned to expectations arrayed in a 60-item Q-sort. The total group of 
subjects, all of whom had experienced a year in a school-based alternative 
teacher education program, attached highest priority to items reflecting direct 
and positive interaction with pupils. On the basis of lower level priorities, three 
subgroups or types of student teachers within the larger group were 
distinguished. Type A is defined as giving low priority to ministerial and 
school directives, Type B is most concerned with classroom control and 
personal confidence, and Type C emphasizes the importance of social 
interaction among pupils. (Dr. Housego is in the Department of Educational 
Psychology and Dr. Boldt is a member of the Science Education Department, at 
the University of British Columbia.) 


At the University of British Columbia, there are two standard degree 
programs that lead to the degree of B.Ed. (Elementary). One of these, the 
“regular program,” is for students who enter the Faculty of Education after 
completing high school, while the other, the “transfer” program, is for 
students who transfer to the Faculty of Education after completing two years 
of Arts and/or Science courses elsewhere. Each of these programs entails in 
total a four-year sequence of coursework consisting of both professional 
(Education) courses and academic (Arts and/or Science) courses. In the 
transfer program, the first two years contain only academic coursework, 
usually completed at a community college. In the regular program, although 
the first two years contain mainly academic coursework, the students 
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complete an Education course in the second year, and in addition undertake 
student teaching. For both programs, the third and fourth years are very 
similar, consisting almost entirely of professional coursework and student 
teaching, theory alternating with practice. The stress throughout is on 
general teaching competence, but students are allowed a limited opportunity 
to begin to concentrate in special areas of interest, both academic and 
professional. 


The Faculty of Education has initiated programs of teacher education that 
are different from, and alternative to, the standard programs. One of these is 
known as COVET (Cooperative Venture in the Education of Teachers). In its 
first three years of operation, it has been an alternative to the third year of the 
standard transfer program. In its fourth year, it will be a parallel alternative 
in the standard regular program. Essentially, the COVET alternative 
replaces the professional year with a program that is entirely school-based, a 
group of about 25 students receiving their coursework in one elementary 
school from a team of university personnel, and engaging in student-teaching 
in the same school throughout most of the year. 


Program Rationale 


One of the most persistent problems in teacher education is that of 
ensuring that theory and practice are successfully melded. The COVET 
program, which was designed to attempt a solution to this problem, is based 
on the following assumptions, stated in the current COVET Handbook 
(Bates, 1976): 

1. there are some components of a teacher education program that the public schools 
and public school teachers are best able to provide; 

2. there are some components of a teacher education program that the university and 
university teachers are best able to provide; 

3. the likelihood of a more successful marriage between theory and practice will be 
increased when the school components are monitored by university personnel and 
when university components are monitored by school personnel; 

4, effective monitoring, as described under 3, is more easily assured when the two sets 
of components occur in the same physical setting. 


The initiators of this school-based teacher education program believe that 
participating teachers should see and know the university’s contribution and 
should be allowed to influence and modify it to some degree, and similarly, 
that the participating teachers from the Faculty of Education should see and 
know the school’s contribution and should be allowed to influence and modify 
it to some degree. 


The public school has the resources and facilities to provide the practical 
component in a teacher education program. In the COVET program, students 
are deliberately exposed to a variety of classroom conditions and teaching 
styles; they experience working with children outside the classroom as well as 
inside, and they continually assess their own personal attributes and 
teaching aptitude from one situation to another. They are able to observe and 
be a part of many interrelationships that occur within the school, 
relationships among school personnel, and between school personnel and 
other agencies such as district services, parents, and the wider community. 


The university is equipped to provide theoretical foundations for 
education, to summarize and interpret research and writing which have 
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implications for the improvement of teaching, and to provide a broader 
perspective than is customarily encountered in a single school. Therefore, the 
COVET program is based in a public school where student teachers receive 
theoretical instruction from university teachers working in the school, where 
they attempt to apply theory under realistic conditions with the guidance of 
the school’s and the university’s teaching staffs, and where daily interaction 
among three groups, the student teachers, the regular school teachers and 
university teachers occurs (Bates, 1976). 


Program Operation 


The development and implementation of alternative teacher education 
programs at U.B.C. in 1974 was based on several considerations: the belief 
that no one program to educate teachers had been demonstrated to be 
superior to all others; the expectation that smaller scale programs would 
probably be more personal than larger ones; the expectation that students or 
professors choosing their preferred program or style of teaching and learning 
would make greater efforts to ensure the success of the particular program, 
and, finally, the hope that alternative programs might serve the purpose of 
generating improvements in the standard programs of the faculty. In 
response to an invitation to all faculty to submit proposals for alternative 
programs, the COVET proposal was submitted to the special committee set up 
to receive such proposals. By the fall of 1974 the program was operational in a 
Vancouver elementary school with 24 third-year transfer students enrolled. 


Aside from their lectures, the experiences of COVET students were of four 
types: classroom-laboratory, clinical, internship, and research and develop- 
ment. Classroom-laboratory experiences involved tasks selected to illuminate 
certain educational ideas such as various approaches to grouping for 
instruction or particular teaching strategies. Clinical experience provided 
’ real cases for observation in that a student might, for example, observe his 
sponsor-teacher working with an emotionally disturbed child or attempting to 
solve a disciplinary problem. Internship experiences were largely ones in 
which students engaged in teaching under realistic conditions with 
decreasing amounts of supervision. Research and development experiences 
were ones in which students attempted to relate research techniques and 
findings to school problems and conditions, perhaps carrying out their own 
minor investigations. 

Students worked with a number of different sponsor teachers during the 
COVET year, which began with the commencement of school in September 
and ended with the university term the following April. Exposure to a variety 
of grade-levels and personal teaching styles was thereby assured and further 
augmented by a period of experience in another school of the student’s own 
choice, a school known to have some features not present in the COVET 
school, for example, family grouping or perhaps open-area teaching. 

The COVET program was assigned its own classroom in the school, this 
space serving as a location for university instruction in the teaching methods 
of the various subject-matter areas of the elementary school curriculum, in 
child growth and development, in measurement and evaluation and in 
educational foundations, in total, the equivalent of six full-year courses. The 
assigned classroom also served as a common room, a study area, and a 
library. 
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For two additional years, 1975-1977, the program has continued to operate 
in a similar fashion in a Richmond elementary school. In consultation with 
the school staff, some changes have been made in the number, length, and 
spacing of ‘teaching blocks,” a teaching block being a period of time in which 
students are engaged in classroom activity for at least four half-days weekly, 
and during which time university instruction is accordingly reduced. Two 
three-week “instruction blocks,” periods of time devoted exclusively to 
university instruction, one in September following a week of orientation 
activities, and one in January, have been standard. The number and variety 
of guest speakers, special seminar events, and opportunities to participate in 
school-wide functions (conventions, non-instructional day programs, etc.) 
have varied from year to year. Course offerings have changed somewhat 
either because of course changes in the regular program or in accordance with 
the availability of personnel. An outdoor education experience has become a 
regular feature. Despite modifications in content and basic organization, the 
program still bears a fairly close resemblance to the original. 


Innovations are generally called upon to prove themselves, the COVET 
program being no exception. It was evaluated by the Northwest Regional 
Laboratory as part of an evaluation of all alternative programs in 1975, 
midway through their first year of operation (Worthen, Owens & Anderson, 
1975). Internal, formative program evaluation instruments and procedures 
were also designed and used, and these generated some program changes; 
however, after three years of operation, a closer look at the “student product” 
of the program seemed appropriate. Having previously assessed student- 
teacher and sponsor-teacher reactions to course offerings, particular 
experiences, methods of scheduling, program administration, and the like, it 
seemed timely to shift focus to the psychological domain of the persons in the 
program. Could their reactions to a specific set of “critical behaviours” that 
occur in teaching be characterized in a particular way, and if so, could 
perceptions and changes in perception concerning these behaviours be used 
as a point of departure for program evaluation and program development? 


Problem 


One of the acknowledged strengths of school-based teacher education 
programs, a strength identified by student-teachers, by school personnel and 
by university personnel is their potential to provide a practical, realistic, and 
meaningful preparation for beginning teachers. Such programs afford the 
student the opportunity to become an integral part of a school in operation 
and to develop priorities among a set of behaviours which characterize the 
teacher’s role, as perceived by experienced teachers and educational theorists. 
In order to pursue questions of priority with regard to components of 
teaching, it becomes necessary then, to identify behaviours which experienc- 
ed practitioners and/or theorists regard as essential to defining the role of the 
professional educator. 


According to Broudy (1958), education has distinctive problems of practice 
and resource of knowledge that can be used to deal with these problems. 
Problems of practice fall roughly into four categories: aims and policy, 
organization-administration, curriculum, and teaching-learning. Critical 
teacher behaviours in the aims and policy category have to do with 
formulating and implementing educational aims and objectives that are 
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consistent with educational policies established on some rational basis. 
Behaviours in the organization-administration category are all those that 
pertain to managing and organizing students, teachers, staff, resources, 
facilities, support services, and so forth, in order to facilitate the educational 
process in an appropriate and acceptable way. The curriculum category 
includes teacher behaviours having to do with designing, implementing, and 
evaluating curriculum specifications and materials. Lastly, behaviours in the 
teaching-learning category have to do with developing, implementing and 
evaluating teaching-learning strategies. Behaviours in this final category 
include lesson planning, teaching (directly or indirectly), and evaluating 
learning outcomes. Although the categories are not mutually exclusive, they 
are useful in mapping in a general way the activities of a teacher in the role of 
a professional educator. 


Since schools operate in a number of different contexts of which the major 
ones are philosophical, psychological, historical and societal, problems of 
practice in the four categories described earlier will usually occur in one or 
more of these contexts. Considering these contexts as dimensions, one can 
say that critical teacher behaviours often fall on one or more of these 
dimensions (philosophical, psychological, historical, and societal) each of 
which spans the previously described four categories of critical teacher 
behaviours. To illustrate, an item, “controlling the class,” can be classified as 
belonging to the organization-administration category and also as falling on 


Categories of Teacher Behaviour 


Aims and Organization- Teaching— 
Policy Administration Curriculum Learning 


Controlling 
the Class 


Historical Psychological Philosophical 


Contexts in Which Schools Operate 


Societal 


Figure 1. A two-dimensional categorization chart for components of teaching. 
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the philosephical, psychological, historical and societal diinensions (see 
Figure 1). Controlling the class from a particular philosophical perspective, 
for example that of the realist or the pragmatist, illustrates a philosophical 
component, whereas controlling the class according to the principles of 
behaviour m-ecz.fication illustrates a psychological component. Many 
teachers control classes simply according to tradition, perhaps through 
detentions or other conventional forms of punishment, thereby illustrating an 
historical component of the behaviour. Finally, parents, government, and 
society at large often have considerable influence on how children are 
controlled, thus illustrating the societal component of the particular 
behaviour. 


Given a set of possible critical behaviours identified on the basis of the 
foregoing rationale, the problem is to assess the priorities assigned these 
behaviours by (a) students currently in the COVET program (COVET III), 
and (b) students in the COVET program during 1975-76 who are now 
completing their B.Ed. degrees (COVET II). The study also attempts to locate 
common patterns of priorities which emerge either in the total group of 
subjects or in subgroups thereof. These patterns characterize hypothetical 
types of subjects identified in later sections. © 


Method 
Instrument 


A 60-item Q-sort was used to measure perception of priorities among 
teacher behaviours, the independent variable of this study. The items in the 
Q-sort consist of concise statements of teacher behaviours generated 
according to the previously described rationale. 


An original pool of over 100 critical teacher behaviours was constructed 
through an on-the-job and theoretical analysis involving discussions with 
principals, teachers, and student teachers, and a study of the literature 
(Ryans, 1960; Sontag, 1968; Broudy, 1958). Four faculty members involved in 
the COVET program for three years screened and edited the items to ensure 
validity for the purpose intended, i.e., to determine what priority, according to 
COVET students, a beginning teacher ought to assign to each of a range of 
critical teacher behaviours. The validity check reduced the pool of 100 items to 
the 60 items used in the final Q-sort. The items are listed in abbreviated form 
in Table 3. 


Subjects 


In the first phase of the study, presented in this paper, a group of 39 
student teachers was studied. Twenty of the student teachers had been in the 
COVET program in 1975-76 and were completing a subsequent year of studies 
at the university, primarily in the areas of their academic and professional 
concentrations. The remaining 19 were just completing the 1976-77 COVET 
program, having transferred directly from two years of academic work in 
regional colleges to the COVET program. 


Administration of the Instrument and Scoring 


The 60-item Q-sort was administered to each of the 39 subjects. The 
subjects were asked to sort the behaviours into a quasi-normal distribution 
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TABLE 1 
DESIGN AND SCORING SCHEME FOR THE Q-SORT 


Evaluative Highest Medium Lowest 
Criterion Priority Priority Priority 
Category fal 10 9 8 u 6 5 4 3 2 
Frequency 2 3 4 6 9 12 9 6 4 3 2 
Q-Score 11 10 S) 8 i 6 3) 4 3 z iL 


M = 60 items 


according to the priority each behaviour ought to be given by a beginning 
teacher. The design of the Q-sort and the scoring scheme is given in Table 1. 


Test-Retest Reliability of the Q-Sort 


In order to assess the stability of the Q-sorts over time, ten students, 
selected on the basis of convenience and availability from the COVET III 
group, repeated the initial sorting task after an intervening time period of one 
month. To what extent they constitute a random sample of all 39 subjects is 
not known, although no obvious student-characteristics could be seen as a 
source of systematic error. 


The average test-retest coefficient of association was 0.67, which indicates 
that the sortings were reasonably stable over time, at least over a period of 
time in which the students were not in training. Along with this, it was noted 
by the authors that the highest priority items and the lowest priority items 
remained the same in the retest. 


Statistical Analysis 


The individual profiles of Q-scores of all 39 subjects were analyzed as 
follows: 

1. Every subject’s profile of Q-scores was correlated with every other subject’s pattern 
of scores in order to determine common patterns of priorities. 

2. The intercorrelations between individual profiles were analyzed to determine the 
least number cf common patterns that could adequately represent the priorities of 
the subjects. The resulting common patterns of priorities were taken to characterize 
hypothetical types of subjects existing within the whole group of COVET students. 
Whether or not these types represented COVET II and COVET III students 
respectively, and therefore distinguished one group from the other was of 
particular interest. 

3. In the final step of the analysis, an attempt was made to describe and compare the 
different types of subjects in terms of priorities assigned to the critical behaviours. 


Results and Interpretation 
Correlations Between Profiles 


The correlations between Q-score profiles of the 39 subjects ranged from 
0.01 to 0.75. Examination of the correlation coefficients suggests considerable 
consensus on the priorities assigned the items. 
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Identification of Subjects According to Type 

The subjects were grouped or categorized on the basis of the type with 
which they were most highly correlated. Subjects 1, 4, 8, 14, 18, 27, and 29 
belong to mixed types. Subject 38 is only weakly associated with any of the 
three types identified. Table 2 shows type membership by subject number, the 
number of subjects of each type and the COVET group to which each belongs. 


TABLE 2 


SUBJECTS BELONGING TO THE THREE DIFFERENT TYPES IDENTIFIED 
BY NUMBER AND COVET GROUP 


Types 2 
A B (¢ 
6 3| al 
8 4 2 
9 5 18 
10 7 Lis 
aL: 14 23 
V2 15 24 
13 ai 5s 25 
17 30 31 
i] 32 38 
ce 
22. 
26 
oa | 
28 
29 
33 
34 
35 
36 
37 
39 
Total Zi 9 9 


- Subjects above the dotted line are in the COVET III group and those 
below the line are in COVET II. 


Comparison and Description of the Types 


Items on which there is consensus across types. A consensus item is one on 
which, for all types, the difference between the largest z-score and the 
smallest z-score given that item is less than 1.00. Table 3 gives the items on 
which the three Types of students agree in order of priority, from highest to 
lowest. The rank-ordering of the items in terms of priority was based on the 
size of the average z-score of each of the consensus items across all three 
types. 

Of note is the large number of items (70%) on which all the types seem to 
agree. From examination of the priorities given to these items, it seems clear 
that teacher behaviours falling on the psychological dimension of the 
teaching-learning category are top priority items for the beginning teacher, 
while behaviours falling on the societal dimension (compliance with 
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TABLE 3 


CONSENSUS ITEMS IN ORDER OF PRIORITY FROM HIGHEST 
TO LOWEST PRIORITY 


Item Average Z-score 


9. Stimulating interest in learning 
10. Interacting positively with pupils 
25. Being enthusiastic 
31. Setting appropriate standards of achievement 
7. Planning with pupils 
40. Providing constructive feedback 
4. Choosing appropriate level of subject matter 
5. Choosing appropriate level of skills 
12. Giving clear directions 
16. Assisting pupils with subject matter 
15. Encouraging good oral communication 
6. Planning prior to teaching 
1. Using variety of teaching methods 
2. Knowing subject matter 
30. Displaying a sense of humour 
39. Keeping records of achievement 
13. Questioning effectively 
35. Disciplining pupils 
3. Being competent in skills 
59. Relating well to parents 
34. Ensuring safety of pupils 
28. Speaking clearly 
37. Marking fairly 
11. Using teaching aids 
26. Using appropriate language 
17. Giving suitable assignments 
55. Participating in extra curricular activities 
57. Keeping a day-book 
23. Appearing neat and clean 
44. Conforming to policies of the school 
52. Respecting school property 
47. Using school equipment in a responsible way 
51. Using school supplies in a responsible way 
19. Giving suitable homework 
58. Keeping attendance records 
38. Marking promptly 
29. Modulating the voice 
18. Summarizing the lesson 
49. Giving adequate notice about changes 
32. Teaching groups of various sizes 
24. Dressing appropriately 
33. Supervising outside the classroom 
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expectations of society and school authorities) of the organizational category 
are of lower priority. 

Items on which there is disagreement across types. Figure 2 shows the 
items which differentiate among the three types. The differences between one 
type and the other two depicted in Figure 2 are summarized and discussed 
below: 

More than Types B and C, Type A puts high priority on: 

21. Being patient 


More than Types B and C, Type A puts low priority on: 
20. Keeping classroom neat and attractive 
41. Teaching approved curriculum 
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45. Conforming to requests of Ministry of Education 
46. Conforming to guidelines set by principal 
48. Responding promptly to requests by the principal 
56. Reporting to parents 

More than Types A and C, Type B puts high priority on: 
20. Keeping classroom neat and attractive 
27. Appearing confident 
36. Controlling the class 

More than Types A and C, Type B puts low priority on: 
14. Assisting pupils in social interaction 
42. Evaluating new curriculum materials 
50. Making reasonable demands for materials and facilities 
55. Participating in extra curricular activities 
60. Utilizing community resources 

More than Types A and B, Type C puts high priority on: 
14. Assisting pupils in social interactions 
48. Responding promptly to requests from the principal 
56. Reporting to parents 

More than Types A and B, Type C puts low priority on: 

8. Timing parts of the lesson 

22. Writing or printing legibly 
27. Appearing confident 
33. Organizing pupils for instruction 
43. Developing new curriculum materials 


A closer examination of Type A suggests that behaviours given lower 
priority than that given by the other types are those that fall along the 
societal (compliance with authority) dimension of several categories, 
curriculum, organization-administration, and aims-policy. Integrating these 
results suggests a perhaps somewhat anti-establishment group of students. 
Table 2 shows that the two COVET groups have similar numbers of students 
who share this view. 


Type B emphasizes the priority of controlling the class, housekeeping, and 
appearing confident. These items can be as seen as falling along the 
psychological dimension of two categories, organization and, possibly, 
teacher-learning. Lower priority was given to items falling on the societal 
dimension (dealing with people and engaging in out-of-class activities 
involving the concerns and interests of students and society at large) of two 
categories, curriculum and organization. Taking these results together 
suggests a possibly insecure group of students who feel they have more than 
enough to cope with in managing the classroom. Of special note here is that 
Type B is predominantly made up of students in the COVET III program (see 
Table 2). 

Type C, unlike the others, stresses the social interaction of students and 
meeting the expectations of administrators and parents, all societal concerns. 
These items seem to fall along the psychological dimension of the teaching- 
learning category and aspects of the historical and societal dimensions of 
organization and administration. Lower priority was given to other items in 
the organization and curriculum categories. The dimensions of these 
categories on which these items seem to fall are not clear, but most appear to 
have to do with the “mechanics” of teaching. Combining these results 
suggests a group of students who emphasize the humanistic aspects of 
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teaching. There seems to be no distinction between the two COVET groups on 
this priority. 

The results of this study highlight considerable consensus. Overall, the 
highest priority is assigned to behaviours relating to direct and positive 
interaction with students, and the lowest priority is given to organizational 
and administrative behaviours. Another outcome of the study is the 
identification of at least three types of subjects, distinguished on the basis of 
the relative priority given behaviours having to do with personal concern, 
concern for controlling the class, and concern for the social and public aspects 
of teaching. 


Only one clear distinction between the two already established subgroups, 
COVET II and COVET III, can be made. Type B, which put relatively high 
priority on personal concerns, housekeeping and controlling the class, was 
primarily represented by COVET III subjects, which may suggest that the 
more directly and the more recently the students have been involved in 
classroom teaching, the greater their concern for survival type behaviours. 


It appears that the instrument developed in this study does discriminate 
between students recently involved in classroom teaching and those who are 
presently removed from it, pursuing academic and professional course work. 
Perhaps more importantly, it may have some practical utility in program 
development and evaluation. 
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BOOK REVIEW 


PSYCHOLOGY FOR THE CLASSROOM. By Janice Gibson. Scarborough, 
Ontario: Prentice-Hall of Canada Ltd., 1977, $12.65, 563 pp. 


Psychology for the Classroom is an academic textbook designed for 
university use. It would most likely be of use to instructors in the field of 
educational psychology, classroom management, or the psychology of 
instruction. 


The text is rather comprehensive and does justice to most of the material it 
covers. Major sections include: (1) The Developing Individual — childhood 
years and the adolescent years. This section will not particularly enthrall 
developmental psychologists because topical issues related to education are 
given more coverage than developmental issues and principles. However, it is 
true to the book title in this regard. (2) Principles of Learning and Teaching — 
which overviews associationist learning theories, motivation in the 
classroom, and other traditional materials such as learning, forgetting, and 
transferring. (3) Individual Differences — which deals with “‘social disadvan- 
taged” students with special needs, and the role of the teacher in coping with 
these matters in the classroom. This is an ambitious section done within the 
limits of a discipline which has a narrow, essentially technical, understan- 
ding of these social-philosophical issues. (4) Methods of Evaluation — which 
deals with testing, types of tests, and their classroom value. I found the 
section on the “developing individual” to be the weakest, with the evaluation 
section being the strongest. The writing style is consistent throughout, with 
emphasis on readability without resorting to the popular tendency among the 
writers of undergraduate textbooks to be cutesy or humorous. 


The book is best suited to those professionals who prefer the “traditional” 
approach to educational psychology, who emphasize testing and evaluation, 
and who emphasize the variety of teaching-learning techniques which owe 
their origins to psychological learning theory. Little “soft” language 
punctuates the book but, which to me is a relief, there seems to be a conscious 
effort to make learning theory less stuffy and monotonous than some writers 
pretend it is. Each chapter and most subsections benefit nicely from 
summaries. Examples are used judiciously and with good effect. All in all, the 
book is well written, rather substantial, comprehensive, and academic. And, 
for a book which covers everything from Maslow’s need hierarchy to operant 
conditioning, that is a feat unto itself. My only real grievance with the book is 
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that the chapter on adolescent psychology is stereotypic and fails to assess 
pioneer ideas in the field, falling back instead on the guiding philosophy of 
the book: traditional educational psychology. The back to basics advocates 
will enjoy the book. 


J.J. Mitchell, Professor 
Department of Educational Psychology 
The University of Alberta 


The Same but Different 
Understanding the Preschool 
Retarded Child 

by John B. Fotheringham, M.D. 

& Joan Morris, M.Ed. 


This is a book of interest to anyone who 
wishes to understanda retarded child. The 
authors mention in their preface that their 
intended audience includes ‘‘professional 
teachers and counsellors who, in their 
concentration on the normal or average, 
have had to overlook the ‘different’ one 
who will need their extra patience, 
teaching and guidance.”’ (from the 
preface) Dr. John Fotheringham, director 
of Child Development and Counselling 
Services at North York General Hospital 
and Joan Morris, head of Guidance 
Services at Eastview Secondary School in 
Barrie, reason and write about a most 
complex subject with a clarity of style that 
will engage the attention of all readers. 
They point out that retarded children have 
the same basic needs and rights as all 
other people but that their retardation 
influences the expression of these needs. 


108 pages/paper, $3.85/cloth, $6.00 


Counselling the 


intellectually Gifted Child 
by Avner Ziv 


This “jargon-free handbook’”’ is ‘‘an 
excellent introductory overview for 
anyone — teachers, psychologists, 
parents, and the children themselves — 
interested in identifying and considering 
programs for the intellectually gifted.” Dr. 
Ziv, director of the counselling program at 
Tel Aviv University, cautions readers that 
“the intellectually gifted are still children 
with emotional and physical abilities in 
keeping with their chronological ages, 
and that they must be considered as 
children rather than I.Q. scores.” 

from Quill & Quire 


124 pages/paper, $4.95/cloth, $11.50 
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Guidance Centre 
Faculty of Education 
University of Toronto 
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HUGH MUNBY 


and 


ROBERT WILSON 


Queen’s University 


Convergent and Discriminant Validity of 
Classroom Observation Instruments: Conceptual 
Background, Critique, and a Case in Point 


An epistemological interpretation of the Campbell and Fiske model is offered 
to show its usefulness in validating classroom observation instruments. This 
interpretation is used to critique a recent validation study, and is then applied 
to the problem of validating the construct “View of Science,” using a classroom 
observation instrument. 

The validation study consisted of submitting a composite of three written 
test instruments to 47 science teachers, and analyzing their teaching 
behaviour using three corresponding observation instruments. Eighteen of the 
21 possible correlations in the Campbell and Fiske matrix achieved ap- 
propriate levels of significance. This result demonstrates the plausibility of 
using the multitrait-multimethod model for validating observation in- 
struments, and establishes a preliminary validity for the construct “View of 
Science.” (Dr. Munby is Associate Professor of Curriculum at the Faculty of 
Education at Queen’s University; Dr. Wilson is Associate Dean of Research, 
Development, and Continuing Education in the same faculty.) 


This investigation is concerned with the validity of classroom observation 
instruments, and with the theoretical significance and practical application 
of the multitrait-multimethod validation model. Questions about the validity 
of all measures in educational research persist for at least two reasons. First, 
it is evident that the link between measured behaviours and our concep- 
tualization of what these show or can be taken to mean is a tenuous one, at 
best, and this is a feature of the social sciences which renders its epistemology 
more problematic than that of the physical sciences. Second, and related to 
the first, in many areas of educational research there is still inadequate heed 
taken of the need for validity of instruments. For example, the prevalence of 
author-constructed instruments of unknown characteristics which are 
employed to tap constructs having unknown conceptual relationships to 
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familiar constructs is a frequently cited complaint in reviews of science 
education research (Trowbridge, 1972; Novak, 1973). Likewise, while there 
tends to be much information available about the reliability of classroom 
observation instruments, there appears to have been substantially less 
attention given to the validity of these devices (Borich & Malitz, 1975). Itis as 
if we can agree to term a specific teaching behaviour “lecturing,” but still not 
be sure that the construct accurately captures the act. 

it is within this context of uncertainty about validity that the study 
reported here is lodged, not with a view to extinguishing the uncertainty but 
to arguing theoretically and instancing empirically that it can begin to be 
addressed. For it will be shown here that a theoretically derived construct, 
measured by a classroom observation scheme, can be validated against 
observation and written measures so that its theoretical relationships with 
other constructs are documented and its characteristics in use are delineated. 
The villain of this piece is the construct “View of Science,” described below; it 
is the purpose of this study to validate this construct using the Campbell and 
Fiske multitrait-multimethod matrix. Before the design and results of this 
study can be discussed it is necessary to explore the elementary epistemology 
that informs the concept of validity in general and the Campbell and Fiske 
matrix in particular. This is necessary for distinguishing a conceptually 
inappropriate use of the model (in a study critiqued later) from a conceptually 
appropriate use (as demonstrated in the validity study reported here). 


A Basic Epistemology for Construct Validation 


The standard account of validity centres on the general notion that the 
validity of a measuring instrument represents the correspondence between 
what in fact the numbers are measures of and what they are thought to be 
measures of. A major implication of this definition is that it makes no sense to 
consider that the validity of an instrument is conceptually distinguishable 
from the validity of a construct. That is to say, there is no way of ever 
knowing for certain what, in fact, numbers generated by social science 
instruments are measures of. To claim otherwise is to subscribe to the 
fallacious assertion that our constructs are not constructs (that is, terms 
invented by us to capture what we think we observe: for example, intelligence, 
memory, anxiety, dogmatism), but are real entities to be found in the 
phenomenal world (could the skull be appropriately pierced). Direct physical 
and sensory contact with our constructs is plainly impossible, so it must be 
owned that the idea of a construct’s validity cannot imply a secure anchoring 
to something independent of human perception and language. Instead, 
validity implies something about the definitional and explanatory usefulness 
of the construct in question. 


From this it follows that the validity of a construct (or of an instrument) is 
incomprehensible and valueless until it is couched in a conceptual context. 
For instance, the validity of an IQ test is meaningless if it is solely a matter of 
correlating its measures with those of another IQ test, because this strategy 
regresses to circularity. Meaning comes to the validity of a construct (such as 
IQ) or a measure (such as an IQ test) when the construct is shown to be 
connected with other constructs in a fashion which permits useful ex- 
planations and testable predictions. 
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Not surprisingly, this view of the validity of a construct permeates the 
physical sciences: Planck’s h and Pauli’s neutrino are valid constructs only 
because they successfully plug conceptual gaps in elaborated theoretical 
structures. Here it is plain that, for meaning, the concept of validity must 
incorporate the place of a construct within a network of constructs which, 
together, bear some relationship to observations. 


Margenau (1950) puts it this way: 


Constructs admissible in science must be multiply connected; they may not be 

insular or peninsular; sets forming an island universe must be excluded. (p. 87) 
Two significant sorts of connections are, for Margenau, formal and epistemic. 
Formal connections are those which incorporate constructs within a network 
of related constructs. Epistemic connections are those which connect 
constructs to observations. Significantly, constructs not so connected are 
inadmissible to the sciences because they are impotent for explanatory and 
predictive purposes. 


These considerations can be applied to the multitrait-multimethod 
validation matrix of Campbell and Fiske. This model incorporates two 
important features. First, it tests for the convergence of a novel construct or 
trait with a similar and known construct, and the divergence of the novel 
construct from a dissimilar but known construct. In the epistemological terms 
developed above, convergence tests for coherence of the novel construct 
within a conceptual network. (A convergence represented by a perfect 
correlation is not expected, of course, for that would imply that the construct 
is identical to rather than convergent with another construct.) Divergence 
assures us that the construct under investigation does not converge with 
everything and is, therefore, a unique construct. In measurement terms, 
obtained divergence permits the assumption that obtained variance is not 
simply the consequence of applying a single measuring technique. 


The second important feature of the Campbell and Fiske matrix is the 
incorporation of more than one method of measurement. In measurement 
terms the tapping of two varieties of behaviour allows one to check the 
variance that may be a function of the tapping of behaviour in a particular 
way. The argument from an epistemological viewpoint is equally compelling; 
the greater the number of different types of epistemic connections 
demonstrated for a construct, the more sound the construct is taken to be. 
This is a particularly important feature of test and construct validation in our 
view, especially when the subject of a validation inquiry is a classroom 
observation instrument. To deny its significance is to assume that a teacher’s 
classroom behaviour is independent of any mental states (beliefs, knowledge, 
myths and so forth) which could be “tapped” by other measures. 


These epistemological and measurement viewpoints lead to the minimal 
Campbell and Fiske matrix given in Figure 1. 


So long as it is understood that “construct” and “trait” are in- 
terchangeable, the conceptual significance of the minimal matrix may be 
described as follows. Suppose that the validity of a test reputed to measure the 
novel construct (Test A) by an observation instrument (Method 1) is in 
question. First, that measure should correlate significantly with Test B, since 
it measures the same trait, using a different method, a written test (Method 2). 
This multimethod axis of the model assures us that any validity score of Test 
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fe le tee Known and Known and 

ie ae Related Construct Unrelated Construct 
rns 
ch 


Figure 1. The minimal components of a Campbell and Fiske matrix. 


A is not an artifact of the method of measurement used. Second, so that we 
can be assured that A and B measure a construct which is not isolated but 
coheres with another construct having a defined conceptual coherence, a 
second construct (known and related) representing a coherent and converging 
construct is identified, and instruments C and D are selected with the intent 
of having A and B scores correlate significantly with those of C and D. Third, 
in order that A and B are measures of a distinctive construct (that is, they 
don’t converge with everything) it is necessary to show that they discriminate 
from other measures, E and F, of a known construct conceptually understood 
to have no relationship to the other two constructs. The theoretically derived 
hypotheses are that measures A, B, C, and D will correlate positively and 
significantly, while measures E and F, while correlating positively and 
significantly with each other, will have insignificant correlations with each 
of A, B, C, and D scores. 


Correlational techniques are naturally appealing for analyzing the data in 
this matrix. Yet, as Kalleberg and Kluegel (1975) have pointed out, the 
predicted zero order correlations provide ambiguous information. Basically, 
these correlations do not include partial correlations (interactions) that may 
exist among the methods and traits measured. The suggestion is that a path 
analytic approach to the analysis of the multitrait-multimethod matrix be 
undertaken on the data collected after a confirmatory factor analysis has 
been performed. While this is not the only approach that could account for 
interactions, it is a sensible one. One of the requirements of the approach 
would seem to be a large number of method x trait cells within the matrix in 
order to reduce error. Many empirical studies, including the one that will be 
reported in this paper, fail to meet this requirement. 


Critique of a Recent Validation Study 


In a recent paper, Borich and Malitz (1975) propose to use the Campbell 
and Fiske model to validate three classroom observation schemes, IAST, 
FAIR and COS, reported by Rosenshine and Furst (1973). The use of the 
multitrait-multimethod matrix in this study presents a bit of a puzzle for the 
following reasons. First, all the instruments used are classroom observation 
instruments, and so the study does not appear capable of tapping behaviours 
through more than one method, except in the sense that the same behaviour 
(a videotape of teaching) is examined using three observation instruments. 
Accordingly, the study uses a monomethod technique. Consequently, 
correlations among measures reported include to some unknown degree, a 
“single method” component independent of the construct being measured. 
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Second, the Campbell and Fiske model, as interpreted in the previous 
section, requires the use of a measure of a construct that is not anticipated to 
correlate with any of the other constructs or measures — it tests the divergent 
validity of the construct in question. This aspect of the model is not 
straightforwardly present in the study for the three observation instruments 
have very similar features. For example, all three systems have a category 
equivalent to “lecture.” Also, all three have a category for coding the 
equivalent of “affective moves.” Obviously, a coder cannot check both 
“lecture” and “affective move” for the same piece of teacher verbiage on a 
single scheme. So, use of one category precludes use of another — the data are 
ipsative. It is erroneous to correlate such data and construe the results as a 
test of validity. Rather, we have here a strenuous test of reliability. This is 
plainly revealed in Figure 2, adapted from the study of Borich and Malitz. 


IAST FAIR 


Accepts B Ge6) 


IAST 


Questions B 


Values A 
FAIR 


Delves B -.14 (. 84) 


Figure 2. Simplified illustration of the validation model used by Borich and Malitz. 


Typically, validation is displayed in the “validity diagonal,” (A-FAIR and 
A-IAST, B-FAIR and B-IAST). Yet it is to be remembered that A and B scores 
are ipsative so correlations within the validity diagonal are reflections of 
coding error and not of convergent or discriminant validity. 


This misinterpretation of the model can explain Borich and Malitz’s 
nonplus that: 

When one pairs categories, one is hypothesizing that the two categories measure 
the same behavior, that is, they will demonstrate convergent validity. But at the 
same time, one is hypothesizing that each of the paired categories differs from 
other categories in its own system and in the other system. In other words, a 
hypothesis about convergent validity necessarily includes a hypothesis about 
discriminant validity. (p. 430) 


Of course! And it is precisely this logical entailment which signals that the 
Campbell and Fiske model is inappropriately used here. 


The last problem is implicit in the above quotation in which the talk is of 
categories measuring behaviour. In fact, categories record counts of 
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behaviour, so when a category is used it is not measuring a construct or a 
portion of one, except when a construct is mounted atop the frequencies, 
conceptually; instead it is recording a behaviour. Since in the Borich and 
Malitz study, constructs and devices purporting to tap them are not in 
evidence, the notion of construct validity simply does not apply, much less 
that of convergent and discriminant validity. 


Convergent and Discriminant Validation of “View of Science” 


It has been the overall argument of this paper thus far that the Campbell 
and Fiske model is especially suited to validating classroom observation 
instruments whose measures allegedly have epistemic connections to 
constructs. It is the purpose of the remainder of this study to instance that 
this is so and, by this, to establish the validity of the construct “View of 
Science.” This construct is derived primarily from Nagel’s descriptions of 
Realism and Instrumentalism (1961). An earlier study (Munby, 1973) 
analyzed these philosophical positions, and reported a classroom observation 
scheme that reliably identified the view of science presented in the teaching 
discourse (Munby, 1974). The purpose of the present study was to validate this 
measure and its construct. Relevant portions of this observation or analytical 
scheme, appearing in Munby, Kitto and Wilson (1976), show the basis for 
selecting other constructs which appear to have formal connections with the 
bipolar construct “View of Science.” The potential significance of this 
construct to science education is discussed elsewhere (Munby, 1976). 


Instrument Selection 


Figure 3 shows the instruments selected for the multitrait-multimethod 
validation study. These selections were based on the following con- 
siderations. 

The “Nature of Science Scale’ (Kimball, 1967) has philosophical bases 
similar to those of the Munby System. (Furthermore, the items can be seen to 
reflect the features of the observation scheme itself.) Conceptually, the 
“Nature of Science Scale” and the Munby System measure the same trait, 
using different methods. That is, itis assumed that a teacher’s view of science 
and his teaching of science are similar, and that “View of Science” is 
epistemically connected both to teaching behaviour and to responses to the 
“Nature of Science Scale.” 


CONSTRUCTS (TRAITS) 


View of Authoritarianism Performance 
Science (convergent) Anxiety 
(divergent) 
Obsextalionh Munby Flanders Behavioral 
System System Checklist 
Written Nature of Califomia-F PRCS 
Science Scale Scale 


Figure 3. Instruments used in the validation study. 


METHODS 
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Analysis of the social implications of Realism and Instrumentalism 
(Munby, 1973) suggests that one whose posture is Realist would evidence 
authoritarianism and dogmatism. For proponents of Realism, science 
provides us with unalterably true statments about the world, and science is 
the only way of obtaining accurate and reliable information about the world 
— this being recognizably similar to Roszak’s “Myth of Objective Con- 
sciousness” (Roszak, 1969) which, for him, underlies society’s posture toward 
science. These and statements characterizing Realism appear to reflect an 
authoritarian and dogmatic view about truth and about science. Instrumen- 
talism, on the other hand, makes softer claims about truth, the nature of 
theories, and the capability of science. So, an Instrumentalist is expected to be 
neither authoritarian nor dogmatic. 


This conceptual cohesion is cemented by Rokeach’s work (1960), in which 
the following are associated with authoritarianism and dogmatism: concrete 
thinking, intolerance of ambiguity, premature closure of perceptual processes. 
Such characteristics are consistent with a Realist view of science, and 
inconsistent with an Instrumentalist view. Accordingly a formal (conceptual) 
connection is posited between View of Science and Dogmatism, though they 
are not identical constructs. 


Dunkin and Biddle (1974, pp. 93-101) trace the origin of the Flanders 
system of classroom interaction to earlier work in which teaching behaviour 
was characterized along the continuum democratic — autocratic. Additional- 
ly, they report a number of studies in which a teacher’s “directedness” 
appears related to scores on dogmatism scales. For these reasons, the 
Flanders system appeared a likely candidate for the convergent observation 
measure. Of the many ratios which can be calculated using the Flanders 
system, the Pupil Initiation Ratio (PIR) was judged closest conceptually to the 
construct dogmatism, “The PIR is proposed to indicate what proportion of 
pupil talk was judged by the observer to be an act of initiation” (Flanders, 
1970, p. 102). 

It was found difficult to identify a distinctively divergent trait having both 
a written measure and an observation measure suitable for classroom 
observation for, by definition, the construct “View of Science’ represents a 
basic philosophical posture toward science and reality and so it can be 
expected to converge somewhat with several attitude and knowledge 
measures. (A search of the literature revealed that there are very few 
constructs for which more than one method of measure is available.) A 
potentially suitable candidate was found in the work of Paul (1966). His work 
on performance anxiety includes two measures: a written Personal Report of 
Confidence as a Speaker (PRCS), and a timed observation checklist for 
performance anxiety in speaking. Theoretically, there is no reason to suspect 
formal connections between performance anxiety and “View of Science’ or 
Dogmatism. 


Given the amount of work involved in coding classroom behaviour and the 
availability of at least three measures of dogmatism, a pilot study was 
designed to test the written method portion of the Campbell and Fiske model 
(Munby, Kitto & Wilson, 1976). An additional “trait’’ of random numbers was 
added to the validation model. The Nature of Science Scale correlated (r= .51, 
p <..001) with the California-F Scale, and the PRCS and Random Numbers 
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had near zero correlations with the Scales, as predicted. Accordingly the 
California-F Scale was selected for the validation study. 


Hypotheses of the Validation Study 


The hypotheses of this multitrait-multimethod study can be represented on 
a validation matrix, as in Figure 4. The validity diagonal is represented by 
AB. Anticipated significant positive correlations in the matrix are 
represented by “+” while near-zero insignificant correlations are represented 
by “NS.” The correlations along the diagonal CD will be measures of 
reliability. A total of 21 correlations is predicted. 


MUNBY PIR PAUL KIMBALL CAL-F PRCS 
O VIEW - MUNBY c + 
B 
S 
E 
R 
y  DOGMATISM - PIR + uy 
A 
y 
I 
) 
y ANXIETY - PAUL NS NS + 
A ar | 

VIEW - KIMBALL + + NS a 
W 
R 
I 
7 DOGMATISM - CAL-F NS + - 
we 
E 
N 

ANXIETY - PRCS an, . | NS NS + 


Figure 4. Hypotheses of the validation study. 


Methodological Considerations 


The scaling of the ‘““Nature of Science Scale” and the PRCS were changed 
from True/False to Likert-type (+3, +2, +1, -1, -2, -3) to correspond to the 
California-F Scale. 


Items from the three written measures were randomized and printed as a 


single composite test to ease administration and to attempt to reduce effects 
of response sets. 


The Munby System was used to judge each 30 seconds of teaching 
discourse as “Very Instrumental’ (2), “Moderately Instrumental’ (1), 
“Intermediate” or “Marginal” (0), “Moderately Realist” (1), “Very Realist” 
(2). A “Realist Ratio” (the total R score divided by the total R and I scores, 
represented as a percentage) can then be computed for each lesson. 
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A computer program was devised to produce Flanders matrices, and to 
compute values of PIR (Kitto, 1976). 


Subjects 


Fifty-five science teachers randomly selected from local schools, agreed to 
participate in the study. Each was asked to provide the time and date of a 
class which would involve teaching, as opposed to testing or conducting 
laboratory work. While participants were informed that the research was to 
validate an instrument, the nature of the variables and instruments was not 
disclosed until the data were collected. 


Each lesson was recorded on audiotape for later analysis by the Flanders 
and Munby Systems. Live observation was used to produce a PRCS score, and 
in 25 of these cases, two observers were present to provide reliability data. 
After a lesson was recorded, the teacher completed the composite test. 


The lessons represented a wide range of subjects and grade levels. Eight 
lessons were removed from the study because they were either too short (less 
than 10 minutes) or they produced no score on the Munby System. (Despite 
the requests, some teachers had scheduled laboratory exercises which 
commenced with managerial matters about performing the exercise.) These 
exclusions left 47 lessons for analysis. 


Data Analysis and Discussion 


The program used for producing the multitrait-multimethod matrix was 
adapted from the SPSS package (Nie, Bent & Hull, 1970). Spearman 
corrections were made for reliability coefficients and the validity coefficients 
were corrected for attenuation, yielding the matrix in Table 1. (Reliability 
values for observation measures should be understood as interobserver 
agreements.) 


Of the 21 correlations in the matrix only three failed to correspond with 
those predicted in Figure 3: PIR and the Munby System did not correlate 
significantly; the written measure of anxiety as a public speaker (PRCS) 
correlated significantly with the California-F Scale, whereas in the pilot 
study (Munby, Kitto & Wilson, 1976) the correlation was .09 (p < .13) and, 
surprisingly, the written and observation measures of anxiety failed to 
correlate, suggesting that while each measure functions to show the 
discriminant validity of the construct “View of Science’, anxiety as a public 
speaker did not meet minimal convergent validity criteria. 


The remaining significant correlations, while in the anticipated direction, 
are not high. Their failure to account for more substantial portions of the 
variance is not too problematical, though, for two reasons. First, it needs to be 
recalled that convergence, in this model, will not approach unity simply 
because this would demonstrate that the converging constructs are the same. 
In other words, the formal connections tested by the multimethod-multitrait 
model are just that — connections. Second, while the assumption that 
teaching behaviour is reflexive of teacher views or beliefs is conceptually 
attractive, it is not very likely that the assumption is thoroughly tested by 
coding one sample of each teacher’s teaching. The revelation that the 
correlations are significant is cause to view the assumption with respect, and 
to put it to a fuller and more exacting test. 
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TABLE 1 
MULTITRAIT-MULTIMETHOD MATRIX (N = 47)? 
eg en ee Lk ee Ce eee 
MUNBY PIR PAUL KIMBALL CAL-F PRCS 
VIEW - MUNBY .97 


.001) 


DOGMATISM - PIR 


Zo et Sl es <  e mrs  O 


ANXIETY - PAUL -96 


: 4 -001) 


VIEW - KIMBALL =o aloe) 


(21) (.001) 


DOGMATISM - CAL-F = il) 


(.10) 


2 MW BwAHnRHH wD =z 


ANXIETY - PRCS ind 


(ez) 


-89 
(.001) 


Dope seer 7 
Significance levels in parentheses 


Two further matrices were produced to investigate the validity of the 
construct “View of Science” at the ends and at the middle areas of the 
distribution of scores on the Munby System, for the middle range was quite 
flat. A high group of fifteen teachers was identified whose “View of Science” 
score exceeded 69 (z > +.43) and a low group whose score fell below 57 
(z <-.43). The matrix of this group, with coefficients corrected as before, 
appears in Table 2. While all values move closer to the anticipated values 
suggested by Figure 3, the anomalies noted above persist. 


Conclusion 


This study has attempted to provide an epistemological interpretation of 
the Campbell and Fiske multitrait-multimethod validation study, and to 
show its theoretical potential for the validation of classroom observation 
schemes. This view of the model was used to critique a similar validation 
study. A validation study on the Munby System and the construct “View of 


Science” was reported as an instance of a more appropriate use of this 
validation technique. 


Ray and Heeler (1975) opine that a single validation study is insufficient 
and that a series ought to be undertaken. The results of the present study 
support that position. Although they are not entirely satisfactory, the results 
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TABLE 2 
MULTITRAIT-MULTIMETHOD MATRIX, HIGH-LOW GROUP (N = 30) 
. MUNBY PIR PAUL KIMBALL CAL-F PRCS 
° VIEW - MUNBY .99 
B (.001) 
Dy 
E & 
R 
V DOGMATISM - PIR 22 .99 
A (onl) (.001) 
T 
I 
) 
N ANXIETY - PAUL -.15 0 .99 
(G22) (.49) (.001) 
VIEW - KIMBALL .76 
(.001) 
W 
R 
I 
T  DOGMATISM - CAL-F 
by 
E 
N 
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ANXIETY - PRCS : 
(.001) 


4 0 3 
(.28) (.49) as C16) 


show that “View of Science” has promise as a verifiable construct. A further 
validation study might investigate the appropriateness of the sampling of 
teaching behaviour and so investigate more thoroughly the assumed 
relationship between teacher behaviour and teacher belief and view, possibly 
using the techniques suggested by Kalleberg and Kluegel (1975). 


It has been suggested above that few classroom observation instruments 
are subjected to systematic validation, and it may be that some are not 
amenable to the procedures reported here. Also notable is the impressive lack 
of unanimous results for studies using such devices. While it remains unwise 
to posit a causal connection between these observations, it does seem plain 
that if research using complex and detailed classroom observation schemes is 
considered worth the effort, it is likely useful to validate these devices in a less 
perfunctory fashion. 
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Anxiety Reduction Correlates of Adult 
Learners: A Longitudinal Study 


This exploratory study attempts to identify: (1) major sources of stimuli in the 
instructional setting that elicit anxiety of adult learners, (2) the changes of 
anxiety level through the entire course period, and (3) potent factors that either 
bring about reduction of anxiety or retain a low level of anxiety. Data from 81 
adult learners attending six different professional education courses in the 
Faculty of Education at Brandon University suggest that there are basically 
three types of anxiety: anxiety about the course setup, anxiety about the 
interpersonal process, and anxiety about the evaluative outcomes. Levels of 
anxiety vary but they all display some degree of decline towards the end of the 
course. Of the potent instructional factors accounting for reduction in anxiety, 
there are a few whose effects are persistent throughout the duration of the 
course, some whose influence on anxiety emerges or re-emerges in different 
periods of course instruction, and still others whose impact on anxiety are 
totally displaced towards the end of the course. Changing needs and 
expectations of adult learners are significantly related to the amount of 
anxiety experiences. (Dr. Lam is Assistant Professor in the Faculty of 
Education at Brandon University, currently on leave as Post-doctoral Fellow 
at the University of California at Los Angeles.) 


A question of continuing interest to all adult educators is how adult 
learners effectively acquire knowledge and skills. While possible answers to 
this question can be sought in many ways, the exploration of some crucial 
cognitive disorganizers seems to be a fruitful approach. The negative effect of 
anxiety, in particular, has attracted the widespread attention of people from 
various disciplines: neuro-physiologists, biochemists, psychiatrists, and 
psychologists. Moreover, there is no lack of theories developed to account for 
the phenomenon regarding anxiety (Fischer, 1970). Surprisingly, however, 
few of these efforts have been translated into specific suggestions for 
educators to follow in fashioning the teaching-learning process. Three 
plausible explanations can be advanced here for speculation. 


1. Consensus is still lacking with respect to the conceptual meaning of 
anxiety. Overconcern for the development of microtheories of anxiety has so 
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far prohibited the much needed communication and integration (Spielberger, 
1966). 

2. Parallel to the theoretical fragmentation of theories are the 
methodological difficulties of measuring emotions which tend to be transitory 
in nature and are often ignored in psychological research (Spielberger, 1966). 

3. Research probing the relationship between anxiety and concept 
learning has produced inconsistent results, and further expansion of scope in 
investigation seems urgently needed to clarify the issue. 


An examination of recent research in the area of anxiety suggests that the 
aforementioned weaknesses are partially overcome: authorities of the various 
disciplines were brought together for dialogue (Spielberger, 1972). The 
emergence and continued development of Spielberger’s anxiety instrument 
making explicit the distinction between “state” and “trait”? anxiety proves 
extremely useful in assessing, with greater accuracy, such an emotional 
impact on learning (Meyers & Martin, 1974). 


Problem 


To adult educators, however, these efforts still leave a wide gap when 
applying findings to practice. There is a growing acceptance that anxiety is 
not simply motivational but frequently a cognitive disorganizer, particularly 
to aged learners (Cattelli, 1961; Schaie, 1968). The main problem initially 
confronting adult educators is the identification of the origins of stimuli 
arising from the actual learning setting, rather than from experimental 
situations (as is frequently reported in psychological research) which trigger 
state and trait anxiety. Spielberger’s distinction between these two types of 
anxiety, while useful in resolving inconsistency of research results, may be 
practically insignificant to those involved in adult education. A major reason 
is that adult learners are not screened on the basis of trait anxiety and the 
amount of anxiety detected must be the end product of interaction between 
the two types. 


Another question relatively unexplored but crucial to adult educators is: 
How does the anxiety level of the learners fluctuate throughout the teaching- 
learning process and what are pertinent factors that bring about reduction of 
their anxiety level? 


In the light of the proposition and some new tentative empirical evidence 
(Lam, 1976) that adult learners’ needs and expectations are not constant but 
undergo varying degrees of change over a span of time, another related 
question to be asked is how do needs and expectation changes affect variation 
in anxiety levels? 


Method 
Sample 


Instructors offering the education extension courses on campus were 
initially approached. To minimize group differences in anxiety arising either 
from exposure to different teaching styles or from different duration of 
courses, two instructors who offered more courses of similar duration were 
eventually chosen from those who consented to take part. Essentially it 
entailed six courses in the areas of Special Education and Educational 
Administration. All of these courses had been scheduled once every week for 
three hours throughout the same semester. 
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Data of this study were compiled from 81 out of 85 adult learners who 
enrolled in these courses. The four excluded from analysis were those who 
dropped out from the courses at a fairly early stage. 


The ages of these learners ranged from 18 to 62 with the mean being 30. 
Female learners outnumbered male learners in a ratio of almost two to one (53 
vs. 28). In terms of occupation, 28 were senior university students taking their 
final year in the education program, 35 were teachers, 8 were school 
administrators, and the remaining 10 had professions that were indirectly 
related to education. 


Instruments 


In line with the foregoing argument that Spielberger’s instrument, while 
adequately dissecting state-trait anxiety, fell short of capturing contextual 
stimuli from which observed state anxiety was elicited and was therefore of 
little pragmatic value to practitioners in adult education, a nine-item 
questionnaire attempting to assess both the level and contextual sources of 
anxiety was constructed and used. These nine items were listed as follows: 

(1) To what extent are you contented with the general course outline? 

(2) To what extent are you contented with the direction that the course is taking up 

to this stage? 

(3) To what extent are you certain that this course meets your needs? 

(4) To what extent are you confident of meeting the course’s demands? 

(5) To what extent do you feel at ease regarding the instructor’s general approach? 

(6) To what extent do you feel at ease interacting with your instructor? 

(7) To what extent do you feel at ease interacting with your classmates? 

(8) To what extent do you feel at ease in sharing your pase experiences with others 

in class? 

(9) To what extent are you confident of your performance in the course so far? 

Each of these items was assessed by five-point scale with point 1 on the 
scale denoting, for instance, ‘very discontented,” “very uneasy,” ‘“‘very 
worried,” ‘very uncertain,” to point 5 indicating “very contented,” ‘very 
easy,” “very relaxed,” “very certain,” and the like. Learners were asked to 
circle a point that best reflected their feelings at that period of time. In view of 
the positive nature of the items, anxiety scores of learners were indirectly 
tabulated by consulting the reverse portion of the scale. For instance, if they 
expressed feelings of ease in interacting with the instructor by circling point 4 
in the scale, their anxiety level in this respect was assumed to have a scale 
value of 1. 

Test-retest reliability of this questionnaire yielded coefficients of .70 
(between the first and second period), .71 (between the second and the third 
period), and .67 (between the first and the third period). These seem to be 
satisfactorily high for the present exploratory study. 

A second set of questionnaires measuring content and structural aspects 
of the teaching-learning process was administered. In total, it consisted of 17 
items developed in a previous study (Lam & Wong, 1974). Reliability of the 
instrument was detected by inter-item coefficients. Values of internal 
consistency were found to range from .54 to .68. Given the relatively short 
instrument, the reliability coefficients are not too low. Basically, content 
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factors assess learners’ cognitive ability to follow the course, the extent to 
which reading materials and assignments enhance their comprehension, and 
their perceived adequacy of course preparation on the part of the instructor. 
Structural factors examine: (a) the effectiveness of the formal! interpersonal 
process (teaching strategies) such as lectures or discussions that enhance 
learners’ comprehension of the courses, (b) the extent of harmony in the 
informal interpersonal process, i.e., approachability of instructor and 
classmate sociability, and (c) the condition of the intrapersonal process, 1.e., 
the extent of satisfaction with the instructor and with the course that 
individual learners experience. 

These two sets of questionnaires were administered concurrently to the 
subjects at the beginning (3rd session), midway point (7th session), and end of 
the course (2nd last session). Such repeated measures attempted to capture 
changes taking place during the three critical points of time in the teaching- 
learning process. 


Additional biographical data like sex and age which were reported by 
Cattell and Scheier to be anxiety-related factors (Cattell, 1966) and 
occupational backgrounds of the subjects which received relatively little 
attention in research were all included in the present analysis. 


Statistical Analysis Procedures 


To identify basic types of anxiety emerging from the nine-item question- 
naire, one method of factor analysis, namely Principal Factoring with 
Iteration using Varimax Orthogonal Rotation, was employed (Kim, 1975). 
The selection of this method was based on such considerations as getting a 
better estimation of communality, and a simplified theoretical scheme of 
underlying dimensions, even though in empirical reality the unrelatedness of 
the underlying factors may not be established. Such an inherent shortcoming 
could be borne in mind in the interpretation of the analysis. 


Stepwise-Multiple Linear Regression Analyses were repeatedly employed 
on three separate occasions to identify significant independent variables 
affecting the types of anxiety at different periods of the courses. Multiple t- 
test comparisons were used to detect differences of anxiety levels of the four 
types of learners undergoing varying degrees of need changes. 


Results 
Types of Learning Anxiety 


The anxiety scores of each period (beginning, midpoint, and end of the 
courses) together with the total mean scores were subjected to Principal 
Factoring Analysis. The pattern of inferred factors emerging from each 
period seemed to concur strongly with the overall pattern generated by 


averaging the responses of each learner over the three periods for the factor 
analysis (Table 1). 


It seems that three underlying types of anxiety can be identified. Factor I 
is accounted for by items assessing stress with respect to course outline, 
direction of the course, accommodation of one’s requirements, and the general 
approach adopted by the instructor. Factor II is accounted for by items 
measuring tension regarding interaction with the instructor and classmates, 
and with experience sharing. Factor III is accounted for by items tapping 
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TABLE 1 


VARIMAX ROTATED FACTOR MATRIX OF ANXIETY VARIABLES — 
COMMUNALITIES AND EIGENVALUES 


Factor I Factor II Factor III 
b Overall 
Period st pnd 3rd eae st gnd 3rd Over- st gnd 3rd Over- Communa- 
Vardahiles a all all lity 
Discontentment 
with course -59 -70 -70 ~ibil -.32 -.35 ~24 18 -.17 -.05 .35 spiBh +63 
outline 
Discontentment 
with course bys) olf 62 84 -.26 -.20 a22 ok 2 =-.17 -.09 ne -08 es 
direction 
Concerns of 
Course - 30 - 36 22 44 -.19 -09 -16 -26 54 neil -80 -65 68 
Assignments 
mncertassty ef 6D. ie Te 78 a0 1.362) Po ok 06 36 7 68 
Megeing lesdr : A 0 : -. A , : 3 -O1 c bal é 
Worry about 
Instructor's 59 ae - 83 -82 te =.35) 37 5745) 14 se) -30 +19 .78 
Approach 
Tension Inter- 
acting with antes: silly 28 19 54 - 56 -83 -90 19 14 oil -02 -85 
Instructor 
Tension Inter- 
acting with -19 -07 mae aeAil 63 - 84 81 -81 706 =.03 ev ai HS) (ihe) 
Classmates 
Degree of 
Reluctance in ak Bel) 23 18 w3 83 . 87 84 edi = 18 tS =.04 74 
Experience Sharing 
Lack of Confi- 
dence in alls! - 38 20 . 30 .25 - 06 nical 33 eS Bey) 37/6 -70 59 
Performance 
Eigenvalue 4.40 1.41 ap) 
% of Variance 67.4 7a N55) ifieal 


learners’ concern regarding their ability to cope with course demands and 
their confidence in performance. In view of the nature of these items, it seems 
most logical to equate factor I to anxiety about course setup, factor II to 
anxiety about interpersonal process and factor III to anxiety about evaluative 
outcomes. 


Further examination of these anxiety variables shows a high degree of 
communality which indicates a high degree of homogeneity of items 
assessing anxiety. In terms of variance accounted for by the three emergent 
factors, it seems that anxiety about the course setup explained a greater 
proportion of overall anxiety felt by the learners than did anxiety about the 
interpersonal process and evaluative outcomes (Table 1). 


A crucial concern of lumping the anxiety data of the six groups of learners 
for factor analysis is that it may obscure block characteristics. However, in 
view of the fact that only two instructors were involved, block variations 
arising from different teaching strategies and related contextual factors were 
largely controlled. Supplementary ANOVA analysis detecting group varia- 
tion levels of anxiety for the three identified types at three different points of 
time under the two instructors indicated that no statistically significant 
differences existed (Table 2). This points to the relative unimportance of 
between-group variations as compared with within-group difference in 
anxiety level. 

Further, by plotting the composite scale scores of each type of anxiety over 
the three points of time, it seems that learners from these six courses on the 
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TABLE 2 
SUMMARY OF ONE-WAY ANOVA OF GROUP DIFFERENCES IN ANXIETY 
LEVELS UNDER TWO INSTRUCTORS OVER 
THREE PERIODS OF TIME 


‘ Source of Mean 
Ss Variation ae Square 3 
lst Period 
Anxiety I Between Groups 1 oll EAT Bs. (ONG Sic) 
Within Groups 79 aL S56 
Anxiety II Between Groups il de 220 PD Shey (nha So )) 
Within Groups TS) -4764 
Anxiety III Between Groups al 8595 2.98 (N.S.) 
Within Groups 79 ~4559 
2nd Period 
Anxiety I Between Groups al - 8500 2 aS OI GNinSep) 
Within Groups 79 - 3608 
Anxiety II Between Groups 1 Ihe ALG. DSO. (Ogio) 
Within Groups 79 - 4033 
Anxiety III Between Groups il n PMS) 5AS) (Wa Sion) 
Within Groups i) -4893 
3rd Period 
Anxiety I Between Groups al i a levsi0)s! 3246" (Nese 
Within Groups 79 -9349 
Anxiety II Between Groups itt .9973 Sao Nase) 
Within Groups 79 6423 
Anxiety III Between Groups 1 .9670 WEASY GSE) 
Within Groups 79: -6650 


whole experienced a relatively low level of anxiety, with anxiety about 
evaluative outcomes persistently higher than anxiety about course setup or 
anxiety about interpersonal process. In general, however, the three types of 
anxiety do exhibit some decline towards the end of the courses and such a 
decline is more obvious in the case of anxiety about the interpersonal process. 


Changing Importance of Anxiety-Related Factors 


To determine factors in the teaching-learning process as well as personal 
variables that account for reduced anxiety, nine stepwise-multiple linear 
regression analyses were executed. The reduced anxiety scores were derived 
from the composite scores tabulated by summing up the averages of scale 
scores defined by the previous factor analysis. Thus, the reduced anxiety 
score for course setup was generated by adding the individual mean scale 
score of items 1, 2, 4, 5 of the Anxiety Questionnaire, the reduced anxiety score 
for interpersonal process, from the individual mean scale score of items 6, 7, 8 
of the said questionnaire, and the reduced anxiety score for evaluative 
outcomes from the individual mean scale score of items 3 and 9. The order of 
entry of course content, structural and personal factors as independent 
variables, was determined by the size of the variance accounting for reduced 
anxiety scores. All the 17 variables were included in the nine analyses. For 
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brevity, only those variables found to attain significance in beta coefficients 
as well as F values were summarized in Tables 3, 4, 5, and 6. 


If attention is focused on the appearance and disappearance of variables 
statistically significant in accounting for the three types of anxiety at three 
different periods, some very interesting observations can be made. Each type 
of anxiety, for instance, has its persistent factors (i.e., variables that are 
important throughout the whole process), emergent or re-emergent factors 
(variables that appeared in any two periods), and displaced factors (variables 
that appeared once and were replaced). 


(i) Persistent Factors. In the case of Anxiety I (anxiety about the course 
setup) the persistent factors were “understanding” and “expectation 
change.” These imply that increased understanding of the course content and 


TABLE 3 


STEPWISE MULTIPLE LINEAR REGRESSION ANALYSES ASSESSING 
THE RELATIVE IMPORTANCE OF CONTENT, STRUCTURAL AND 
PERSONAL FACTORS TO ANXIETY REDUCTION OF COURSE SETUP IN 
THREE DIFFERENT PERIODS 


Significant Anxiety Reduction of Course Setup 
Predictors 
ie Period gud Period qa Period 
Bo R® dé /dé, F B R? df, /df> PF B R2 df /dfo P 

omnes 169 23 1/80 13.48 sD 9 SRI 4/77 Ae ASE W513 2/79 8.74 
Understanding 
Eepect ation ew (ay 2/79 6 CSM E mms DiS Wy SWE) ae 4/77 Tene 
Changes 

' 
Pastepctor (2, hits waGE 3/78 54 2 = 
Approachability 
ladles. hogs - 235" 33 1/80 6.02: - 
Satisfaction 
Sex = 22 eo 3/78 RAO yay sss 3/78 4.70 
Age ht sof 3 5/76 2.35 


ea NTE: predictors recorded here have beta coefficients and / values significant at .05 level or better. 


TABLE 4 


STEPWISE MULTIPLE LINEAR REGRESSION ANALYSES ASSESSING 
THE RELATIVE IMPORTANCE OF CONTENT, STRUCTURAL AND 
PERSONAL FACTORS TO ANXIETY REDUCTION OF INTERPERSONAL 
PROCESS IN THREE DIFFERENT PERIODS 


Significant Anxiety Reduction of Interpersonal Process 
Predictors 
18" period a perdad 37 op eriod 

B R2 df /dfp FP B R? df ,/df£> F B R2 = df, /dfo F 
Instructor's cme 2/79 8.46 sei Seyi 1/80 3.93 ie Gy 2/79 8.25 
Approachability 
Age Sie 4736 3/78 8.48 ,205 6 a26 2/79 3.30 ee ci 3/78 5.66 
plan anate 298.20 1/80 5.21 = .34 ©.38 1/80 10.46 
Sociability 
Lecturing 44.43 4/77 10.57 = 520 9,63 6/76 Beg 
Grade ere 5 5/76 2.28 = 
Course Z Sve Pach 5/77 2.53 


Satisfaction 


Fa Atal predictors recorded here have beta coefficients and F values significant at -05 level or better, 
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TABLE 5 


STEPWISE MULTIPLE LINEAR REGRESSION ANALYSES ASSESSING THE 
RELATIVE IMPORTANCE OF CONTENT, STRUCTURAL AND PERSONAL 
FACTORS TO ANXIETY REDUCTION OF EVALUATIVE OUTCOMES IN 
THREE DIFFERENT PERIODS 


Significant Anxiety Reduction of Evaluative Outcome 

a 
Predictors : 

ies Period oo Period 30 Period 
B Ro  df)/dé> F B R2 df, /dfo P B R2 df, /dfo PF 

Grade .24 -08 1/80 4.58 
Expeetation a J30%s. 20 BIOL, WI eOS eps ess e PLs 2h797 9 Ge.49 186 tetas 3/78 12.69 
Changes 
eae 56 eae 2/79 5.13 - 22" 40 4/77 3.84 
Materials 
Soe pane oR ket 6/75 2.41 : 2 50 6/75 3.64 
Sociability 
Discussion wee 38 7/74 papal - - 
Occupation 23 By 7/74 6.17 
Lecturing -38 .29 4/77 5.95 - - 
peter ees : 63.19 a/789) 6052 45° 443) OG79e 5.47 
With Instructor 
Understanding - a -48 24 1/80 9.20 


Sex = = a22 «33 Vk) 3.82 
Age = = a2d -56 8/73 4.39 


Course 
Satisfaction 


7a predictors recorded here have beta coefficients and F values significant at .05 level or better. 

great consistency of learners’ expectations with those of the instructors were 
both significant in lowering the level of learners’ anxiety about the general 
arrangement of the courses. In the case of Anxiety II (anxiety about the 
interpersonal process) the degree of perceived approachability of the 
instructors and age of the learners are persistent factors. Evidently, 
friendliness and acceptance by the instructor reduced the tension of 
interaction in the classroom setting. And, within the age range of the sample, 
the older the learners, the more relaxed they felt towards the learning process. 
Richer working and life experiences that accompany age seniority seem to 
allow older learners to contribute more in formal discussion and to be more at 
ease in interacting with others in these professional courses. 


In the case of Anxiety III (anxiety about evaluative outcomes), it was 
found that once again little need and expectation changes on the part of 
learners had a persistent influence. Most probably, little modification and 
readjustment of one’s initial needs or expectations keep the anxiety level 
about evaluative outome low, though on the whole adult learners still cherish 
some degree of apprehension about being evaluated. 


(ii) Emergent/Re-emergent Factors. With respect to those emergent or re- 
emergent factors, it was found that sex accounted significantly for anxiety 
about the setup in the second and last period of the evaluative process. 
Female learners in general maintained a lower level of anxiety than the male 
as the learning process continued, though such feeling was not marked for 
both groups of learners in the beginning. 


Classmate sociability, lecturing, and grade achieved through course 
performance were re-emergent factors for reduced anxiety about the 
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interpersonal process. Apparently, the immediate acceptance by classmates, 
the perceived understanding of the course through lecturing and some initial 
formal feedback about learners’ performance in the course which were of 
great importance in reducing learners’ initial anxiety about the interpersonal 
process become once again vital factors towards the end. 


Emergent factors for reduced anxiety about evaluative outcomes include 
grade, reading materials, classmate sociability, occupation, and satisfaction 
with the instructors. The importance of formal feedback about learners’ 
performance in the course in relation to reduced anxiety about evaluative 
outcomes in the second period of the learning process, and its disappearance 
in the last period of the learning process seem to suggest that once adults are 
assured of their ability to perform well, their self-confidence is reestablished. 
It was also found that this did not require continued encouragement from the 
instructor in the last critical stage of learning. On the other hand, the 
perceived understanding of the course content through assigned reading 
materials, and the perceived acceptance by fellow learners were vital in 
reducing anxiety about evaluative outcome. In line with the previous 
observation, this seems to suggest that the sources of adult self-confidence in 
performance had shifted from the formal assessment to their own perceived 
abilities and rapport with classmates. 


Satisfaction with instructor, developed in the second stage of the teaching- 
learning process, continued to exert its anxiety-reduction effect in the last 
part. This lends support to the assertion that intrinsic contentment with the 
instructors as individuals serves as a constant check in minimizing anxiety 
about course evaluation. That occupation of the learners should become an 
important factor in anxiety about outcome in the last period of the learning 
process suggests a variation of priorities or values arising from the learners’ 
career background. To learners assuming teaching or administrative 
capacities, course-taking is one of the many ways of professional develop- 
ment. The performance in the course is, therefore, not as much a concern to 
them as to university seniors taking courses for degree and career prepara- 
tion. 

(iii) Displaced Factors. Turning to the displaced factors of the three types 
of anxiety, it was found that instructor approachability, important in 
reducing anxiety about setup in the beginning period, gave way to course 
satisfaction and to the increased understanding of the course through 
discussion midway through the course. It would seem that adult learners’ 
initial need to be accepted by the instructors is no longer their major concern 
as the course develops. 

In the case of reduction of anxiety about the interpersonal process, grade, 
important in the initial stage, was displaced by such new factors as course 
satisfaction and comprehension of course content through reading material. 
This implies that the initial attempts to impress instructors through 
performance in the course became less a concern to adult learners than a 
development in the genuine delight with and understanding of the course. 


Factors accounting for reduction of anxiety about evaluative outcomes 
such as understanding through discussion and lecturing, which were 
prominent in the first phase of learning, were replaced subsequently by 
course satisfaction, sex, age, and general understanding of the course. This 
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TABLE 6 
SUMMARY OF FACTORS RESPONSIBLE FOR THREE KINDS OF 
ANXIETY REDUCTION AT DIFFERENT PERIODS OF OBSERVATION 
(ALL VARIABLES ATTAINING SIGNIFICANT F VALUES) 


Anxiety I (Setup) Anxiety II (Process) Anxiety III (Performance) 
Period lst 2nd 3rd 1st 2nd 3rd lst 2nd 3rd 
Under- Under- Under- Instruc- Instruc-— Instruc- Grade Grade Under- 
standing standing standing tor's tors tor's standing 
approach- approach- approach- 
ability ability ability 
Expecta- Expecta- Expecta- Age Age Age Expecta- Expecta- Sex 
tion tion tion tion tion 
change change change change change 
Instruc- Sex Sex Classmate Classmate Reading Satisfac- Expecta— 
tor's sociabi- sociabi- Materials tion with tion 
approach- lity ity instructor change 
ability 
Course Age Lecturing Classmate Reading 
satisfac- sociabi- materials 
tion lity 
Discus- Classmate Grade Sex Discus- Satisfac- Satisfac- 
sion sociabi- sion tion with tion with 
Laity instructor instructor 
Course Occupa- Classmate 
satisfac- tion sociabi- 
tion wiry 
Lecturing Occupa- 
tion 
Age 


Course 
satisfac- 
tion 


suggests that the adjustment to the style of formal process no longer becomes 
a problem of concern with the passage of time. Rather, the last phase of worry 
and tension about outcomes arises mostly from the characteristics of the 
learners — age, sex, cognitive ability, and inner satisfaction. 


(iv) Factors Responsible for all Types of Anxiety. In search of common 
factors that at one stage or another affect the three types of anxiety, it was 
found that characteristics of the learners, sex, age, and the perceived 
sociability of fellow learners, were the underlying persistent variables. 
Reference to the sample of the study suggests that within the age range of 18 
to 62, the older the learners, the less tension or stress they felt toward the 
general process. In addition, the female learners expressed a slightly higher 
level of satisfaction and less anxiety in all phases of the learning, particularly 
the general setup in the last stage (r = .25, p < .05). Perceived acceptance by 
and sociability of fellow-learners played a significant role in anxiety 
reduction in all stages and this was particularly evident in the last period 
(correlation coefficient with the three types of anxiety reduction were .39, .61, 
and .29; all were significant at .01). 


This tentative dissection of anxiety correlates of adult learners reveals 
that structural and personal factors act jointly with course content in 
bringing about anxiety. In terms of the number of factors involved, anxiety 
about evaluative outcomes is far more complicated than either anxiety about 
the setup or about the interpersonal process. 
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Need Changes and Anxiety Reduction 


In the light of the findings and discussion in the previous section that 
degrees of “changes in needs and expectations” were related to different 
levels of anxiety about the setup and about the evaluative outcome, further 
examination of the relationship between the process of need changes and 
anxiety reduction in all phases of the teaching-learning processes was made. 
Through multiple t-test comparisons, the amount of anxiety reduction 
experienced by the sampled learners was analyzed, with learners categorized 
into four groups as in a previous study (Lam, 1976), ie., in terms of their 
initial similarity/ dissimilarity of needs with instructors and their ultimate 
professed great/small need changes. 


Significant differences in the amount of anxiety about course setup were 
found between Group II (dissimilar initial needs and great final expectation 
changes, N= 32) and Group I (dissimilar initial needs and small final changes 
of expectation, N = 10) and also between Group II and Group IV (similar 
initial needs and small expectation changes, N = 28) in the initial stage of 
learning process (t values were 2.14 and 2.52 respectively, Xu = 2.45, X1 = 2.10, 
Xm = 2.15, p< .05 and above). These reflect that those who cherished a set of 
expectations different from the instructors and subject more readily to 
changes through the teaching-learning process had a higher initial tension 
than those who embraced dissimilar expectations and were not too receptive 
to new stimuli in the teaching-learning process. The former group’s anxiety 
level was also considerably higher than the group having similar expec- 
tations with the instructors and experiencing little adjustment in their 
expectation. On the other hand, the amount of anxiety expressed by Group II 
was not significantly different from Group IV who shared similar expec- 
tations with the instructors and underwent great changes of expectations in 
the process, though this last group also displayed (in terms of anxiety means) 
a higher anxiety level than Group I and Group III (similar initial needs and 
great final expectation changes, N = 11). 

Midway through the teaching-learning process, the amount of anxiety 
level regarding the setup was again detected to be significantly higher in 
Groups II and IV than I and III (t value between Groups II and I is 2.04, X17 = 
2.35, X1 = 2.06, p < .05; t value between Group IV and III is 2.77, Xrv = 1.94, 
Xm = 1.54, p < .01). Apparently those who experience greater changes of 
expectation in the process still cherish a higher level of anxiety about the 
setup than those who undergo smaller changes. 

The amount of anxiety level, in other aspects at all three stages, was found 
to be insignificant among the four groups. 

Tentatively, such evidence suggests that changes of expectations elicited 
some amount of anxiety about the learning setup, and that anxiety level 
apparent in the first two periods began to level off rapidly towards the end of 
the courses so that essentially there were no significant differences among 
the four types of learners identified towards the end of the courses. 


Conclusion 


The present study has added to our understanding of adult learners’ anxiety 
states as explained in the context of other research (Meyers & Martin, 1974; 
Spielberger, 1966), and about their stimuli in the teaching-learning process. 
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Inherent in the teaching-learning situation are three types of anxiety: anxiety 
about course setup, about interpersonal process, and about evaluative 
outcomes. While course setup tends to constitute the predominant source of 
anxiety, adult learners seem most susceptible to anxiety concerning their 
learning outcomes. Stimuli of these three types of anxiety vary considerably 
and reside in many aspects. 

From the perspective of duration and stages of influence, anxiety 
reduction factors fall conveniently into “persistent,” ‘emergent or re- 
emergent,” and “displaced” categories. Both personal factors, age, and 
changing needs and expectations, as well as variables operationalized from 
the paradigm of Bergevin and his associates (Lam & Wong, 1974), for 
instance, course comprehension, instructor’s approachability, classmate 
sociability, course satisfaction, etc., turn out to be important factors in 
explaining adult learners’ anxiety. These findings are not inconsistent with 
those of Boshier (1973) using a “Congruence” model developed from self- 
theory. 


Relationship between need changes and anxiety reduction reveals that 
those who initially had expectations different from those of the instructors 
generally had a higher level of anxiety about the setup. This still holds 
true for those who were undergoing great expectation transformation midway 
through the teaching-learning process. Variation in the level of all types of 
anxiety, however, disappeared for all groups when the end of the courses 
approached. 


Of immediate implication to adult learners, it seems, is the recognition 
that tailoring course materials to the capability of learners and establishing 
good human relations between instructors and learners, as well as among 
learners themselves, are two crucial aspects of anxiety reduction. Given that 
those undergoing greater changes in needs and expectations experience 
greater anxiety about course setup in the first two stages of the learning 
process, it is perhaps pertinent for educators to assist learners to develop more 
realistic educational objectives at the outset so as to reduce unnecessary 
tension in the subsequent learning process. 
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Skills Underlying Coloured 
Progressive Matrices 


Raven’s Coloured Progressive Matrices and a battery of ability tests were 
administered to a sample of 104 grade four boys. The relationships between two 
previously identified subscales of the Raven’s and the ability tests and factors 
were investigated. Even though previous research had identified one of the 
Raven’s scales as a Reasoning by Analogy scale, the present results 
demonstrate moderately strong relationship (r = .42) between this scale and 
spatial ability. This relationship is similar to that between the other Raven’s 
scale, pattern completion, and spatial ability. Results are interpreted to 
indicate that, although subjects could employ several strategies in solving the 
analogy items, those in the present study used primarily a spatial strategy and 
to a lesser extent, reasoning; in this regard the analogy subscale was not 
different from the other subscale. The possible implications of these strategies 
for test score interpretation and future research are discussed. (Dr. Kirby is 
with the Faculty of Education at the University of Newcastle in Australia; Dr. 
Das is Director of the Centre for the Study of Mental Retardation at the 
University of Alberta.) 


Raven’s Standard Progressive Matrices and Coloured Progressive 
Matrices have become popular tests of intellectual capacity. The standard 
version is suitable for adults and adolescents whereas the coloured version is 
meant for use with children and the elderly. The Progressive Matrices were 
originally proposed in 1936 by Penrose and Raven as a series of “perceptual 
tests” in the British Journal of Medical Psychology (cited in Cattell, 1940). 
Since then the Matrices have been regarded as a test of reasoning or Level II 
ability (Jensen, 1970). It has been considered by some (Cattell, 1971) as a 
prototype of a culturefair measure of intelligence, independent of acquired 
knowledge and verbal skills. Other researchers stress that it is a test which 
taps a complex set of processes, some or all of which may be influenced by 
acquired skills (Vernon, 1950; MacArthur, 1975; Das, Kirby & Jarman, 1975). 
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The Coloured Progressive Matrices (sets A, Ab, B) (Raven, 1956) has 
recently been factor analyzed by Corman and Budoff (1974a, 1974b) in a 
variety of populations in order to discover which subscales, and perhaps by 
inference which subskills, underlie the test. In their research, they have 
demonstrated the existence of four subscales or problem types; Discrete 
Pattern Completion, Simple Continuous Pattern Completions, Continuity 
and Reconstruction of Simple and Complex Structures (CRS), and Reasoning 
by Analogy (RA). The first three of these scales involve essentially pattern 
completion, while the fourth consists of a number of nonverbal analogy items. 
The test was factorially invariant across ability and cultural groups. It is 
interesting to note that their empirical classification of problem types 
corresponds closely with one developed by Egorova. 


Egorova, working at the Institute of Defectology in Moscow, has analyzed 
the skills required for solving the Coloured Progressive Matrices based on her 
observations of children’s performance. In her classification, the following 
skills are recognized: identity in simple and complex drawings, symmetry in 
axes, analogy, and miscellaneous skills. While hers is not identical with that 
of Corman and Budoff, all of Egorova’s symmetry in axes items are included 
in the CRS scale, and the analogy (RA) scales are identical if two items 
classed as miscellaneous are included. There is reasonable agreement 
between Egorova and Corman and Budoff on the scales underlying the 
Coloured Matrices. 

The purpose of the present study was to examine the relationships of two 
of these subscales (CRS, RA) with a variety of ability tests and factors. These 
relationships would in turn indicate the nature of the skills and processes 
needed for success on these subscales. More specifically, it was anticipated 
that CRS would be primarily related to tests of spatial ability, and RA to tests 
of inductive reasoning. 


Method 
Subjects 


The subjects were 104 normal grade 4 males from urban schools in 
Edmonton, Canada. Ages ranged from 102 to 127 months, with a mean of 110 
months and SD of 4.9 months. 


Tests and Procedure 

The following seven tests were administered to all subjects in class size 
groups. 
1. Raven’s Coloured Progressive Matrices (Raven, 1965) 
2. Figure Grouping. This is a test of reasoning adapted from the Science 
Research Associates (SRA) Primary Mental Abilities Kit (Thurstone & 
Thurstone, 1962). The subject is required to select which of four figures is 
unlike the others and indicate his answer on an IBM answer sheet. 
3. Word Grouping. Another reasoning test adapted from the SRA kit, this 
test requires the subject to:select which of four words is unlike the others and 
indicate his answer on an IBM answer sheet. 
4, First and Last Names. This test was selected from the French, Ekstrom, 
and Price (1963) Kit of Reference Tests for Cognitive Factors. It is a test of 
associative memory which requires the subject to study a list of 15 first names 
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paired with last names for three minutes. Subjects then have two minutes to 
write as many of the first names beside the appropriate last names which 
appear on a second sheet of paper in scrambled order. 


5. Word-Number. This test is another test of associative memory from the 
French et al., (1963) kit. Subjects study a list of 15 numbers paired with words 
for three minutes and then have two minutes to write as many of the numbers 
beside the appropriate words which appear on a second sheet of paper in 
scrambled order. 


6. Spatial Relations. This test of spatial ability is adapted from the SRA kit 
and requires the child to select which of four shapes, when added to a given 
shape, will form a square. Answers are placed on an IBM answer sheet. 


7. Card Rotations. This test of spatial ability is adapted from the French et 
al., kit. Subjects are required to indicate whether given figures have been 
rotated within the same plane or flipped. Answers are placed on an IBM 
sheet. Because of the difficulty of this task (112 problems in four minutes), 
guessing is penalized by subtracting the number incorrect from the number 
correct. 


Results 


Correct responses for the two subscales of the Raven’s were totalled. The 
actual items chosen for each scale were selected on the basis of the work of 
Corman and Budoff (1974a, 1974b) and Egorova.! The items composing the 
CRS scale were: Ab4, Ab5, Ab6, Ab7, Ab8, Ab9, Ab10, Ab11, B3, B4, B5, B6, 
and B7. The item composing the RA scale were Ab12, B8, B9, B10, B11, and 
B12. The other two scales of Corman and Budoff were not examined due to 
insufficient variance in this sample. KR-20 reliabilities for the CRS and RA 
scales were, respectively, .691 and .777. The remaining six tests were 
submitted to a principal components analysis, and three factors were 
extracted and rotated to an oblique Promax criterion. This analysis may be 
seen in Table 1. The factors were identified as Spatial, Memory and 
Reasoning. Correlations were then computed between the two Raven’s 
subscales, the Raven’s total score (out of 36), and the six ability tests and their 
three factors. These correlations may be seen in Table 2. 


Discussion 


It can be seen from Table 2 that both Raven’s scales are related to both 
spatial and reasoning ability, and that the pattern of correlations are the 
same for each subscale. The correlation between the two subscales was .482. 
Thus, contrary to expectations, CRS and RA were not differentially related to 
spatial and reasoning abilities. These results would seem to indicate that the 
processes or skills underlying performance on these subscales are similar. 
Correlations cannot be conclusive, but the lack of a strong relationship 
between the RA scale and the reasoning factor suggests that these problems 
were not predominantly solved by the present sample of children with an 
inductive reasoning strategy. Rather, it would seem that they relied more 
upon a process akin to spatial ability. The fact that the two scales emerged 
separately in previous factor analyses (Corman and Budoff, 1974a, 1974b), 
and that the KR-20 reliabilities of these scales were high in the present study, 
suggests that the scales do contain different problem types. The present 
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TABLE 1 


PRINCIPAL COMPONENTS ANALYSIS WITH PROMAX ROTATION OF 
ABILITIES BATTERY (N = 104) 


Factor 
Variable ——— h2 
Spatial Memory Reasoning 

Figure Grouping 360 -047 488 541 
Word Grouping -126 -054 1.000 847 
First-Last Names -028 US 282 704 
Word-Number 031 940 -192 815 
Spatial Relations 949 018 => 801 
Card Rotations 873 004 -023 742 
Variance Ser Aeys! 1.388 1.298 4.449 
% of Total Variance 29.4 Paes 26 714.2 
Correlations between factors: 

Spatial 1.000 

Memory . 308 1.000 

Reasoning - 560 5 Swe} 1.000 

TABLE 2 


CORRELATIONS OF TWO RAVEN’S SUBSCALES WITH ABILITY TESTS AND 
FACTORS (N = 104) 


Raven's Scale 
Total Score 


CRS RA 
Ability Test 
Figure Grouping oO) SSsoul 7355 
Word Grouping - 206 278 <297 
First and Last Names aval eos 520) 
Word-Number All 25) AOL SOS 
Spatial Relations - 430 401 508 
Card Rotations Ss iall BLS) ~402 
Ability Factor 
Spatial 437 423 4 Sys) 
Memory .220 p22 “sR sy) 
Reasoning SAM 303 . 386 


# + (.01, two-tailed) = .254 


All differences between correlations (CRS-RA) nonsignificant. 


results show that both problem types are approached with similar strategies, 
relying upon similar abilities. 

The possibility exists that the RA scale requires a form of reasoning not 
represented by the inductive reasoning factor. While undoubtedly different to 
some extent, both require the discovery of some sort of rule. If the RA scale is 
more reliant upon reasoning than CRS is, the correlations between these 
scales and the reasoning factor should have been more discrepant. The lack of 
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discrepant correlations involving the CRS and RA scales and the reasoning 
factor makes it difficult to identify the RA scale as, exclusively, a reasoning 
scale. 


Reasoning is involved in the solution of the two problem types. For both 
CRS and RA scales, correlations with the reasoning tests and factor are in the 
neighbourhood of .3. This shows that even though the inducing of a ruleis not 
the most important determinant of a correct solution, it is involved, at least 
with some children. In particular, the (nonsignificant) difference in 
correlations with CRS and RA and the verbal reasoning test of word grouping 
suggests that a verbal reasoning strategy might be approaching importance. 
A view of the results from this perspective would suggest that performance on 
these two scales of the Raven’s is not independent of verbal acquired skills. 
One may speculate that most subjects employ the spatial nonverbal strategy, 
while some, perhaps those that are most successful, employ an increasingly 
verbal strategy. This latter point has been made by several researchers (e.g., 
Bridgeman & Buttram, 1975) to explain the poor performance of some groups 
on so-called nonverbal reasoning tests: verbal strategies can be very 
successful in these tasks, but are not always used spontaneously. 


The importance of the present study does not involve the determination, in 
any absolute sense, of the nature of the two Raven’s subscales. They seem to 
reflect primarily two different problem types. Rather, the importance is in the 
description of these two scales in terms of the processes or skills which 
underlie them. Raven’s (1965) claim that performance on the test, and in 
particular on the two subscales, is independent of learned skills, especially 
verbal ones, is true for the present sample of nine-year-old boys, who 
performed that test without any special training or instructions. It is 
anticipated, however, that cognitive development would result in increased 
use of verbal strategies, as would training or instructions (Bridgeman & 
Buttram, 1975; Guinagh, 1971). Differences in cultural background (at- 
tributable to both genetic and environmental causes) are also likely to 
produce different approaches to the test problems (see general discussion of 
cross-cultural differences in strategies in Das, Kirby & Jarman, 1975). 


These results also offer support for a position taken by Hunt (1975). In 
devising computer algorithms to solve more advanced Raven’s Matrices 
problems, he discovered two possible strategies: a Gestalt algorithm (similar 
in nature to the passive, spatially-reliant one described above) and an 
analytic algorithm (similar to our conscious, rule-inducing one). Hunt 
concluded that the Matrices test could not be viewed as a measure of 
intelligence, but rather as an indication of cognitive style. It should be noted 
that he also makes the fascinating speculation (p. 155), that the first strategy 
or algorithm is evidence of, in Piagetian terms, concrete operations, and the 
second of formal operations. 


The present results should not be interpreted as either favouring or 
opposing the use of Raven’s Matrices. The test has been shown to be useful, 
and will likely continue to be so. These results do, however, advise caution in 
referring to the test as a test of “reasoning”’; the ability tapped would seem in 
general to be a combination of spatial and reasoning abilities. Furthermore, it 
would seem that the interpretation given to a test score could well depend 
upon the strategy or processes which the subject uses to solve the problems. 
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These strategies are learnable; as such, they encourage viewing test results as 
indicative to a certain extent of what needs to be learned. Clearly, more work 
is required to specify the nature of these strategies, how they can be taught, 
and how they are related to general intelligence. 


Note 
1.Egorova. Personal communication to J. P. Das, November, 1975. 


This study was partly supported by Canada Council and Killam grants to the first author and Canada Council 
Grant S-730056 to the second author. 
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Cross-modal and Intramodal Matching: 
Relationships to Simultaneous and 

Successive Syntheses and Levels of Performance 
Among Three Intelligence Groups 


Cross-modal and intramodal matching ability with visual and auditory 
content was studied in low, normal, and high IQ fourth-grade children. The 80 
children in each IQ group were also administered tests of simultaneous and 
successive syntheses. Results suggested that simultaneous and successive 
central processes are involved in modality matching performance. It was 
further suggested that intellectual competence is not uniquely associated with 
cross-modal as opposed to intramodal matching ability, but instead is reflected 
in an interaction between strategic behaviour and task demands. (Dr. Jarman 
is in the Department of Educational Psychology at the University of British 
Columbia.) 


In recent years, there has been an increase in research on the role of 
sensory systems in children’s information processing (Chalfant & Scheffelin, 
1969; Freides, 1974). A common paradigm used in the study of sensory 
systems is modality matching (Ettlinger, 1967; O’Connor & Hermelin, 1971). 
A stimulus pattern is presented in one modality, followed by a comparison 
pattern in a second modality, and the subject is required to judge the 
equivalence of the two patterns. If the two patterns are presented in different 
modalities the task is designated as cross-modal, and if the same modality is 
used for both, the task is known as intramodal. Thus, for the auditory and 
visual modalities as in the present study, two combinations of intramodal 
matching (A-A and V-V) and two combinations of cross-modal matching (A-V 
and V-A) are possible in the design of tasks. 

The first question of this study is the extent to which sensory systems, as 
opposed to central processes, account for ability to match information cross- 
modally and intramodally. This question has received little direct attention 
despite the large body of research involving the use of tests of matching 
ability. Freides (1974) notes that the theoretical assumptions of these studies 
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may be characterized on a continuum, with the extremes designated as 
“modal-specific” and “nonmodal.” The modal-specific point of view places 
the emphasis on sensory systems in accounting for individual differences in 
matching ability, and the nonmodal position assumes that central processes 
account for performance in these tasks. 


Current information processing models which do not rely on the modality 
matching paradigm for their empirical support generally reflect the non- 
modal view of sensory systems. An example of these models is simultaneous 
and successive syntheses as proposed recently by Das, Kirby and Jarman 
(1975, in press). In brief, Das et al. suggest that information processing may 
be described in terms of four units in a schematic model. In the input unit, a 
stimulus may be presented to any one of the receptors, extero-, intero-, and 
proprioceptors, and within the exteroceptors to any one of the sense 
modalities. The input can be presented in a parallel (simultaneous) or serial 
(successive) manner. The stimulus is registered immediately by the second 
unit, the sensory register, and may be passed on for processing. The third 
unit, which is responsible for central processing, has three major components: 
that which processes separate information into simultaneous groups, that 
which processes discrete information into temporally organized successive 
series, and the decision-making and planning component which uses the 
information integrated by the other two components. Finally, the output unit 
determines and organizes performance in accordance with the requirements 
of the task. 


For purposes of the present study, the important feature of the 
simultaneous-successive model is that information may be represented either 
simultaneously or successively in central processing independent of the form 
of information presentation. For auditory and visual information specifically, 
the model suggests that the forms of syntheses involved in processing 
information from these two modalities are not functionally dependent upon 
the modalities themselves. Thus, auditory information presented successively 
can be organized in central processes as simultaneous or successive 
syntheses. In complement, visual information which is_ presented 
simultaneously can be processed by simultaneous or successive syntheses. 


If various conversions in information are possible in the course of 
processing as suggested by the simultaneous-successive model, and these 
conversions are made primarily according to task demands, the dominant 
dimensions of individual differences in cross-modal and intramodal 
matching tasks may have their basis in central processes. Accordingly, if it is 
demonstrated in exploratory factor analyses that cognitive processes of the 
form suggested by the simultaneous-successive model are related to 
performance in the various combinations of modality matching tasks, then 
because the model operationally defines central processes, the nonmodal 
theoretical point of view will be supported. Conversely, if simultaneous- 
successive processes are not found among the combinations of modality 
matching, but rather, cognitive processes appear to be unique to each task 
dependent upon the modalities involved, this result will be more consistent 
with the modal-specific theory of sensory systems. 

The second question of this study concerns the relationship of cross-modal 
and intramodal matching ability to general intelligence. A number of strong 
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conjectures have been made regarding modality matching tasks as indicators 
of fundamental processes in intellectual competence. These conjectures have 
referred almost exclusively to cross-modal forms of matching with the 
implication that it is the comparison of information between modalities that 
engages basic intellectual processes. Birch and Belmont (1964), for instance, 
suggest that “this type of intersensory integration in its more complex and 
general form may be one of the processes that underlies adaptive behaviour 
and thus IQ” (pp. 859-860). Much stronger conjectures are made by Jensen 
(1969), who states that the “conceptually most pure” aspects of intelligence 
are displayed in cross-modal tasks (Jensen, 1969, pp. 10-11). 

Despite these claims, there is relatively little empirical information 
available on cross-modal matching ability and intelligence (Jarman & Das, 
1977). Past research has been almost exclusively correlational in design, and 
indicates that cross-modal matching does not correlate strongly with 
psychometric intelligence (IQ), although these correlations are generally 
statistically significant due to large sample sizes, ceiling effects in 
instruments, and pooling of groups with different means. Further, the 
assumption of the importance of cross-modal forms of matching is reflected in 
a paucity of studies which have used intramodal tests for purposes of 
comparisons of the relative importance of the two types to intellectual 
development. Thus, the importance of cross-modal relative to intramodal 
processes in cognitive competence has not been tested directly. 


The present study examines the significance of both cross-modal and 
intramodal matching ability to intelligence by determining the extent to 
which each of the possible combinations of these tasks discriminates between 
three intelligence groups. As related to the first question of this study, levels 
of performance in these tasks among the intelligence groups are interpreted 
in terms of simultaneous and successive syntheses. 


METHOD 
Subjects 


A complete survey was conducted on the distribution characteristics of the 
Lorge-Thorndike IQ data available for a large grade four school population. 
Three IQ groups were formed on the basis of both verbal and performance 
scores by setting ranges for sampling. The IQ ranges were 65-85 for the low IQ 
(LIQ) group, 90-110 for the normal IQ (NIQ) group, and 115-135 for the high 
IQ (HIQ) group. Eighty subjects were randomly selected within each IQ 
range. All subjects were chosen from predominately middle-class population 
areas. School cumulative record cards were checked in order to avoid 
inclusion of children with identified learning disabilities. 


Tests 
Cross-modal and Intramodal Matching 


Four tests of auditory and visual cross-modal and intramodal matching 
were used as developed by Jarman (1977). Each of the tests employed the 
same items in order to control for’complexity of information, with variation 
among the tests dependent only on whether the stimuli were presented 
auditorily or visually. A “same-different” format was used in all four tests 
(Goodnow, 1971a, cf. 1971b). This item type requires a dichotomous decision 
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on the part of the subject: the stimulus is presented in the first modality, and 
then removed or terminated, following which a comparison is presented in the 
second modality; the subject is asked if the comparison is the “same” or 
“different” from the stimulus. A complete list of the items in the tests is given 
in Jarman (1977). 


Auditory-auditory matching. This task involves matching a stimulus 
pattern of 1,000 cycle tones with a comparison pattern of tones. All tones are 
of 0.15 second duration with variation in patterns created by short pauses of 
0.35 seconds and long pauses of 1.35 seconds. A score of one is given for each 
of the 30 items in the test. 


Auditory-visual matching. The items in this test utilize an auditory 
pattern as a stimulus as in the auditory-auditory test, but this is matched with 
a comparison visual display of dots which is shown by a slide projector. For a 
dot of 1 unit in diameter, a short gap is 0.80 units in length, and along gapis 
of 7.17 units in length. Each of the 30 items in the test is scored as one mark. 


Visual-auditory matching. This test is the converse of auditory-visual 
matching; the visual display is the stimulus portion of each item and the 
comparison section is a set of tones. 


Visual-visual matching. This test is comprised of sets of visual patterns, 
where the first pattern of each set is compared to the second pattern, with a 
score of one given to each correct response for a test total of 30. 


Simultaneous and Successive Syntheses 

The tests of simultaneous-successive syntheses were drawn from previous 
research (Das, 1973; Das, Kirby & Jarman, 1975). 

Raven’s Progressive Matrices. This is a well-known test involving the 
selection of a missing element from a set of alternatives to complete a spatial 
matrix. The matrices in the test consist of geometric patterns. The total score 
on the test is the number of items correct, with a possible maximum of 36. 


Figure Copying. The figure copying test was developed by the Gessell 
Institute (Ilg & Ames, 1964), and consists of 10 geometric figures which are 
presented consecutively to the subject for reproduction while they are in full 
view. A maximum score of two is possible for each item, with a test total of 20. 


Graham-Kendall’s Memory for Designs. This test was developed by 
Graham and Kendall (1960). A set of 15 geometric figures, each on a separate 
card, are presented for five seconds to the subject, and following the 
presentation of each card, the subject is asked to reproduce the figure. The 
total score on the test is the number of errors made, as described in detail in 
the manual, with scores for each item varying from zero for a satisfactory 
reproduction to three for a reversed or rotated reproduction. The maximum 
score which would result from consistent errors is 45. 

Serial Recall. The serial recall task consists of a set of 24 word lists, with 
four words in each list. Of these lists, 12 contain words which are 
semantically similar to one another, and the other 12 contain four unrelated 
words. The two types of word lists are randomly ordered in the test. Each list 
is presented by the use of a cassette tape recording, following which the 
subject is asked to recall the list in the order given. The total score for each list 
is the number of words in the correct position, with a possible total score of 
four per list, giving a maximum test score of 96. 
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Visual Short-term Memory. This test was developed by E. Howarth andJ. 
Brown of the University of Alberta. Each of the 20 items consists of a five- 
section grid which is presented to the subject for five seconds. Following 
presentation, a two-second filler task of colour naming is used to eliminate 
rehearsal. The stimuli are single digit numbers and the subject is asked to 
reproduce the digits on an empty grid following the filler task. Each digit in 
the correct location is scored as one mark, giving a possible score of five for 
each item and a possible test total of 100. 


Procedure 


The set of four cross-modal and intramodal matching tests were 
administered in a balanced design within each IQ group in order to 
compensate for possible transfer effects. A simple four by four Latin square 
design (Cochran & Cox, 1957) was used with 20 subjects in each IQ group 
randomly assigned to each of four orders of test administration. The tests of 
simultaneous and successive syntheses were given in random order to each 
IQ group. Subjects were tested in random order across IQ groups. 


RESULTS 
Descriptive Information 


Table 1 demonstrates that sampling from three ranges of the IQ 
distribution for grade four resulted in a higher mean chronological age (CA) 
for the subjects in the LIQ group than the mean CA for the NIQ and HIQ 
groups. The CA difference between the groups was statistically significant, 
F(2,237) = 134.63, p < 0.01. The effect of this difference on all the other 
variables was checked in subsequent statistical analyses by using CA asa 
covariate. It was found that after covarying to create a CA-match design 
(Ellis, 1969; Heal, 1970), the IQ groups still retained the same similarities and 
differences on all variables. 


TABLE 1 


MEANS AND STANDARD DEVIATION OF CHRONOLOGICAL AGE AND 
IQ FOR SAMPLE GROUPS4 


LLG NIQ HIQ 
M SD M SD M SD 
Chronological Age WY TKS) 6.53 We oO 3.86 AL Vilbsteke! 3.81 
Verbal IQ WH OS By old 100.89 4.76 124.63 4.95 
Performance IQ 76.78 6.47 OZ 6.97 23 ec 2el4 


a 
Chronological age given in months, 


Also given in Table 1 are the means and standard deviations for the verbal 
and performance IQ scores. The data show that the groups are quite 
symmetric in their composition in terms of the normal IQ curve. 


Exploratory Factor Analyses 


' In order to identify the processes involved in each of the cross-modal and 
intramodal matching tests, the complete test correlation matrix for each IQ 
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group was subjected to principal components analysis. The factors were 
rotated by both orthogonal and oblique procedures (Gorsuch, 1974). The two 
types of rotations yielded virtually identical results, and therefore only the 
orthogonal rotations by Varimax are reported here. The eigenvalue greater 
than one criterion was used for the selection of the number of factors, which 
also yielded comparable results to other procedures, including the scree test 
(Cattell, 1966). 


In the factor matrix for each group, the same set of reference factors, 
namely simultaneous and successive syntheses, was identified. The emphasis 
of the present study is on the use of these factors to interpret the processes in 
the cross-modal and intramodal matching tests, and therefore the rationale 
for labelling the simultaneous and successive factors is not given here. This 
rationale is supplied in Das (1973) and Das, Kirby and Jarman (1975). 

The factor matrix for the LIQ group is presented in Table 2. Factor I of this 
matrix has a moderate loading from Auditory-visual matching and strong 
loadings from Visual-auditory matching, Visual-visual matching, and Serial 
Recall. This factor is unique relative to the usual simultaneous and successive 
factors, and it also appears as Factor I in the analyses for the NIQ and HIQ 
groups. The factor appears to represent verbal coding of information. This 
discussion will return to the nature of the verbal coding factor following 
examination of the results for the NIQ and HIQ groups. 


TABLE 2 
PRINCIPAL COMPONENTS WITH VARIMAX ROTATION: LOW IQ GROUP #? 


Test i Til 1RiLAE 
Auditory-auditory Matching 051 ~ -031 856 
Auditory-visual Matching 407 441 217 
Visual-auditory Matching Shih -106 2Oy) 
Visual-visual Matching 792 207 —091 
Raven's Progressive Matrices -034 651 -294 
Figure Copying 067 743 134 
Memory for Designs 24 -787 -064 
Serial Recall 692 007 057 
Visual Short-term Memory 287 204 687 
Component Variance Up Sake 1.886 1.438 
% Component Variance SOmo 2 Bon02 27.46 

Eigenvalues 2.751 Hs (Aon 1.246 


* Decimals omitted from factor loadings. 


Factor II of the results for the LIQ group has its dominant test loadings 
from Raven’s Progressive Matrices, Figure Copying and Memory for Designs 
(the Memory for Designs loading is negative because the test is scored for 
errors). A moderate loading is also notable for Auditory-visual matching. 
This factor is simultaneous synthesis, and its definition by the first three 
tests is identical to previous results (Das, Kirby & Jarman, 1975). 
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Factor III of the LIQ matrix is apparently successive synthesis. This 
factor has been defined by Serial Recall and Visual Short-term Memory in 
previous research. Interestingly however, in this analysis for the LIQ group, 
Serial Recall loads on the coding factor, rather than the successive factor. The 
sequential nature of Auditory-auditory matching and Visual Short-term 
Memory defines the successive factor here. 

The factor matrix for the NIQ group is given in Table 3. Factor I of this 
matrix shows a clearer definition of the coding factor, with moderate loadings 
from Auditory-auditory matching and Auditory-visual matching. 


TABLE 3 
PRINCIPAL COMPONENTS WITH VARIMAX ROTATION: NORMAL IQ GROUP? 


Test ih Wat iE 
Auditory-auditory Matching 408 109 p23 
Auditory-visual Matching 432 547 004 
Visual-auditory Matching 850 076 237 
Visual-visual Matching 779 184 306 
Raven's Progressive Matrices 319 643 -019 
Figure Copying 110 867 194 
Memory for Designs =377 -642 —071 
Serial Recall 141 032 751 
Visual Short-term Memory 347 =021 693 
Component Variance ZN OIF) 29 oud 
% Component Variance Bolo 7 34.96 PET fa SM 

Eigenvalues 3.641 Wesaly, 1.004 


a F : : 
Decimals omitted from factor loadings. 


Factor II is simultaneous synthesis, and is quite comparable in its 
moderate loading of Auditory-visual matching to the results for the LIQ 
group. Factor III is successive synthesis, with Serial Recall and Visual Short- 
term Memory defining the factor as in previous studies (Das, Kirby & 
Jarman, 1975). Auditory-auditory matching has only a moderate loading on 
this factor for the NIQ group. 


The factor results for the HIQ group are given in Table 4. Factor I is 
comprised almost solely of the modality matching tests, indicating a verbal 
coding process common to all modality combinations. 


Factor II of the matrix for the HIQ group is a clearly defined simultaneous 
synthesis factor, and Factor III is successive synthesis. 


Collectively, these results demonstrate the expected simultaneous and 
successive factors in all groups, and also a first factor that has been 
designated as verbal coding. The verbal coding interpretation of this factor 
appears to be the most parsimonious, in light of two points of evidence. The 
first of these is the difference between two distinct strategy types in modality 
matching as discussed by Kahn and Birch (1968). The first strategy noted by 
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TABLE 4 
PRINCIPAL COMPONENTS WITH VARIMAX ROTATION: HIGH IQ GROUP@ 


Test oT ite TT 
Auditory-auditory Matching 647 249 Sul 
Auditory-visual Matching 713 107 201 
Visual-auditory Matching 834 -101 SMG 
Visual-visual Matching 714 -109 054 
Raven's Progressive Matrices 216 729 307 
Figure Copying 857/ 676 = iL(O}e) 
Memory for Designs Al -905 -076 
Serial Recall 210 019 882 
Visual Short-term Memory 158 017 HWS 
Component Variance 2.390 Le SOM Jk 6 Sieul 
4 Component Variance 40.68 By sel 26.91 

Eigenvalues 7.5 Syl dete7il 1.254 


a : - : 
Decimals omitted from factor loadings. 


Kahn and Birch is the direct recall of tone sequences; in the present study this 
was suggested by the loading of Auditory-auditory matching on the 
successive factor for the LIQ group. The second technique is numerical and 
verbal coding of information. Kahn and Birch noted that in their study 
roughly half of their subjects reported this technique as a dominant strategy. 
In the present study, all of the tests for which the stimulus portion was visual 
were performed by the same cognitive process by all of the groups. It is 
probable that visual displays encouraged the subjects in all of the groups to 
use a coding strategy because of the easier clustering and grouping that was 
possible in this type of presentation. 

The second reason for the designation of this factor as verbal coding is 
perhaps the strongest, and is based on the results for the Auditory-auditory 
task. It is evident that the numerical coding of an auditory sequence involves 
a translation to a new form of cognitive representation: e.g., beep, beep, pause, 
beep, pause, beep .. . becomes 2, 1, 1, etc. (Lehman & Goodnow, 1972). In 
contrast, recall of an auditory sequence in its original form is not a 
translation strategy. The reason that the Auditory-auditory task for the low 
IQ group loaded on the successive factor, then, may be that the subjects in 
this group attempted to recall the sequence literally. In contrast, the high IQ 
group performed the Auditory-auditory task by the same strategy as the other 
tasks: they encoded all information, auditory and visual, to a common code. 
Further, recall of this code did not become a successive task, and instead 
coding formed a separate factor, because it was the accuracy of encoding 
- (Biggs, 1969) and ability to match the code with the comparison stimulus that 
differentiated individuals, not the retrieval of the code. Thus, the verbal 
coding factor is defined by the organization of information, not the successive 
recall of the information. 

If this interpretation of the verbal coding factor is correct, then the factor 
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may be a special case of successive synthesis. The two factors, verbal coding 
and successive synthesis, may be separate in the analyses because the coding 
factor defines individual differences in terms of verbal-mediational encoding 
strategies, and the successive factor defines individual differences in terms of 
rehearsal and memory retrieval. An examination of the modality matching 
items suggests that this may be the case; once numerically coded, the 
quantity of information in most items would be well within the capacity 
limits of the subjects of the present study. Thus, individual differences in 
modality matching when a verbal coding strategy is used would be more in 
the encoding phase. In the successive synthesis tasks on the other hand, 
organized information is presented to the subjects and performance 
differences are more likely in the efficiency of rehearsal and retrieval. 
Because encoding and retrieval are likely independent operations (Brown, 
1975), two separate factors could be defined, as has been found in the present 
study. 


Mean Differences in Performance 


The means and standard deviations for each of the IQ groups on all 10 
variables are given in Table 5. For both the cross-modal and intramodal 
matching tests, and the tests of simultaneous and successive syntheses, an 
increase in performance is found generally from the LIQ group to the HIQ 
group. Scores for Memory for Designs are error scores rather than numbers 
correct, hence its inverse relationship to IQ. 

The strength of discrimination of the cross-modal and intramodal 
matching tests were assessed both among the four tests and relative to the 


simultaneous-successive tests by a multivariate analysis of variance 
(MANOVA) followed by a discriminant analysis (Tatsuoka, 1970, 1971). The 


TABLE 5 
MEANS AND STANDARD DEVIATIONS FOR SAMPLE GROUPS 


LIQ NIQ HIQ 

Test ————————— 

M SD M SD M SD 
Auditory-auditory 20.07 3.86 PASSO) 4.14 RE TPS 3.86 
Matching 
Auditory-visual OOS, 4.49 2308 sO] Def Oi Sh is 
Matching 
Visual-auditory 24.15 4.47 26.67 Bor 28.47 1.88 
Matching 
Visual-visual 2B} (Oil 4.06 Dike sh BSNS 1.96 
Matching 
Raven's Progressive DPL STC 4.95 25.96 yo Allst 31.64 Bly 
Matrices 
Figure Copying 14533 2.02 hen) © eas Joys Peesieyi 
Memory for Designs Sho Wi 3.46 Salk PeZs 1.65 PEO)S: 
Serial Recall G0 19.19 77.86 Wei Ths} WS OS: TF Hei) 
Visual Short-term DS aaG We) Sy 64.01 18.61 69.75 We Sei} 
Memory 


a ee ee ee eee eee eee ee 
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results of the three group one-way fixed effects MANOVA on the 10 variables 
indicated significant differences between the groups, F(20,456) = 10.57, 
p< 0.01. 

The vectors of weights for the discriminant functions are presented in 
Table 6. It was found that the two functions accounted for 57% of the total 
variance in the discriminant space, with X2 = 138.64, p < 0.01 for the first 
discriminant root and X? = 18.07, p < 0.07 for the second root. The results in 
Table 6 for percentage of common variance indicate that Vector 1 represents 
almost all of the common variance in group differences relative to Vector 2. 


TABLE 6 
NORMALIZED DISCRIMINANT FUNCTION WEIGHTS? 


Test Vector 1 Vector 2 
Auditory-auditory Matching 732. TSA? 
Auditory-visual Matching 701 -004 
Visual-auditory Matching 015 —O17 
Visual-visual Matching 163 474 
Raven's Progressive Matrices 674 =O18 
Figure Copying 008 -574 
Memory for Designs 167 087 
Serial Recall 042 =1143 
Visual Short-term Memory -017 015 
% Variance 91.40 8.60 
Eigenvalues 1.094 OOS 


a ‘ ? ' 
Decimals omitted from weights. 


The definition of Vector 1 in terms of tests shows that among the 
simultaneous-successive tests, Raven’s Progressive Matrices demonstrates a 
strong discrimination effect. The strongest discriminator of all of the tests, 
however, is Auditory-visual matching. The other three modality matching 
tests are negligible and quite comparable in their discrimination power 
relative to the Auditory-visual tests. 


DISCUSSION 


The first question of this study was whether modality-specific sensory 
systems or nonmodal central processes account for ability to match 
information cross-modally and intramodally. This was assessed within 
separate IQ groups in order to determine any possible variations in modal 
specificity as a function of intelligence. Within the limitations of exploratory 
factor analysis, the results suggest that the cognitive processes in modality 
matching may be defined in terms of the central process constructs of 
simultaneous and successive syntheses. The factor analyses also suggested 
that successive synthesis may be differentiated into two types. The first is the 
form defined operationally in previous research (Das, Kirby & Jarman, 1975), 
which is based predominantly on individual differences in rehearsal and 
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retrieval processes; this form retained the general designation of successive 
synthesis in this study. The second form was designated specifically as 
verbal coding. In terms of the four modality matching tests, it was found that 
Auditory-auditory matching consisted of successive processes in the LIQ 
group, a combination of successive and verbal coding processes in the NIQ 
group, and mainly verbal coding processes in the HIQ group. Auditory-visual 
matching consisted of a combination of simultaneous and verbal coding 
processes in the LIQ and NIQ groups, and verbal coding processes only in the 
HIQ group. Visual-auditory matching and Visual-visual matching consisted 
of similar verbal coding processes in all three IQ groups. 


Thus, the nonmodal view of ability in modality matching has received 
more support in this study than the modal-specific position. It does not appear 
possible to identify a pattern of processes in the factor analyses that have 
been presented that are systematically related to sensory modalities. Indeed, 
with increasing levels of intelligence, differential factor patterns among the 
four tests appear to disappear entirely, to the point where in the high IQ group 
the same cognitive process accounts for the majority of individual differences 
in all four modality matching tests. 


The second question of this study deals with the role of the matching 
paradigm as an indicator of intelligence. Many past discussions on this topic 
have referred specifically to cross-modal forms of matching with the inference 
that it is the conversion of information across modalities that measures 
cognitive ability. The present study has demonstrated that these conjectures 
have some degree of validity, but are too simplified. The Auditory-visual 
matching task was the most powerful discriminator among all of the tests 
given to the IQ groups. Its cross-modal counterpart however, Visual-auditory 
matching, demonstrated a much lower discrimination effect. In addition, this 
low effect for Visual-auditory matching was comparable to that found for the 
two intramodal tasks, Auditory-auditory matching and Visual-visual 
matching. Discrimination of the intelligence groups therefore, was not 
associated with cross-modal as opposed to intramodal processes. Rather, as 
suggested by the factor analyses, different forms of syntheses are engaged by 
various combinations of these tasks, and further, these patterns vary among 
different intelligence groups. It is more likely then that individual differences 
in strategic behaviour interacted with the different task demands of the 
modality combinations to produce different discrimination powers among the 
cross-modal and intramodal tests. Thus, modality matching tests tap 
considerable variation in strategic behaviour (cf. Jarman, in press) which has 
likely contributed to the variation in points of view on the theoretical 
significance of these tasks. 


These interpretations of the results of the present study, in turn, suggest 
two areas of further research. The first of these is concerned with the popular 
assumption of modal-specific deficits in learning-disabled children (see 
Freides, 1974). Future studies using both cross-modal and intramodal tasks 
may reveal that disabilities are based predominantly in central processes, 
possibly in simultaneous and successive syntheses or in highly similar 
constructs. Independent research using other paradigms such as dichotic 
listening also suggests this possibility (Das, Kirby & Jarman, in press). 

A second area of possible research is the use of children’s sensory 
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modalities in instructional design. One consequence of the modal-specific 
point of view has been to incorporate sensory modality variables in remedial 
and aptitude-treatment experimental programs (Bissell, White & Zivin, 1971; 
Lilly & Kelleher, 1973; Magoon & Raper, 1977; Silverston & Deichmann, 
1975). This incorporation is likely premature, and for children at the age level 
of those in the present study, it may be more important for instructional 
purposes to consider the cognitive strategies and central processes 
characteristically used in different types of tasks. At younger age levels, on 
the other hand, sensory modalities may prove to be important instructional 
variables. It may be conjectured that the role of modalities shifts with 
increasing age, such that a transition takes place from modal-specific 
processes to central processes as a function, in part, of increasing use of 
mediational and cognitive strategies. Longitudinal studies spanning the 
early years of school would contribute much in investigating this possibility. 


Notes 


1.The nomenclature commonly used to designate the ordering and type of the stimulus pattern and the comparison 
pattern is the first letter of each modality name. For example, A-V would designate an auditory stimulus followed 
by a visual comparison. 
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The Effect of Sentence-Combining Practice 
on Written Compositions and 
Reading Comprehension 


This study investigated the effect of sentence-combining practice on the 
syntactic maturity of written compositions and on reading comprehension. 
Students from two grade seven classes were randomly assigned to experimen- 
tal and control groups. The analysis was based on 13 males and 13 females in 
each group. The experimental group completed 26 sentence-combining lessons 
during the 16 weeks of study. The sentence-combining problems were based on 
O’Hare’s (1973, 1975) format with a change in combining signals from upper to 
lower case letters. Syntactic maturity of written compositions was measured by 
mean words/T-unit from a 45 T-unit sample of writing. Reading comprehen- 
sion was measured by the comprehension subtest from the Gates MacGinitie 
Reading Tests — Survey E. The experimental group made a significant gain 
(p < .05) in syntactic maturity and a significant gain in reading comprehen- 
sion (p < .001). However, these gains were not significant when compared at 
posttest with the control group. (Mr. Smart is President of the Prince George 
District Teachers’ Association; Dr. Ollila is at the Faculty of Education at the 
University of Victoria.) 


In the study of elementary education, the relationship between the 
productive aspect of our written language (composition) and the receptive 
aspect of written language (reading comprehension) is of particular interest. 

A number of studies (Evanechko, Ollila & Armstrong, 1974; Hughes, 1975; 
and Kuntz, 1975) have found a positive correlation between measures of 
syntactic maturity of students’ writing and measures of reading achievement. 
For curriculum and instruction purposes, correlations between measures are 
of little value. What is important is whether instructional procedures 
designed to improve one aspect of language arts will also have a positive 
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effect on other aspects of the language arts. Specifically, will sentence 
combining practice designed to improve the syntactic maturity of students’ 
written compositions also contribute to improved reading comprehension? 
The assumption behind this question is that sentence-combining will give the 
students practice in manipulating the syntactic system of our language. If 
students become skilled in this manipulation, they should become more 
effective in dealing with syntactic structures while writing and reading. 


Interest in the effect of sentence-combining practice on students’ writing 
has grown considerably since O’Hare’s (1973) study with 73 grade seven 
students. O’Hare based his study on Mellon’s (1969) work, replacing the 
transformational grammar terminology in the sentence-combining problems 
with actual word cues. O’Hare’s experimental students made significant 
gains on six factors of syntactic maturity including the mean number of 
words/T-unit. The experimental group was also significantly superior to the 
control group at posttest on all six factors of maturity. The experimental 
group’s posttest compositions were also judged to be significantly better in 
overall quality than the control group’s posttest compositions. It is 
unfortunate that the O’Hare study did not include pretest and posttest 
measures of reading comprehension, as noted by Stotsky (1975) in her review 
of the curricular implications of sentence-combining. 


In a study using sentence-combining with 100 grade seven students over a 
period of three months, Combs (1975) reported pretest, posttest, and delayed 
posttest results. The experimental group’s posttest compositions showed 
significantly more syntactic maturity than either their own pretest com- 
positions or the control group’s posttest compositions. The delayed posttest 
showed an erosion of the experimental group’s gains but the experimental 
group was still significantly more mature than the control group. 


Cooper (1975) commented that it has been “convincingly demonstrated 
that sentence-combining will increase both syntactic maturity and overall 
quality of writing of upper elementary and junior high school children’ (p. 
(PA 

The question remains — will sentence-combining practice have an effect 
on reading comprehension? 


Stedman (1971), in a study with grade four students, attempted to 
determine the effect of a curriculum teaching syntactic embedding on reading 
comprehension as measured by a cloze test. In his total experimental group, 
Stedman found a significant gain in cloze scores over the total control group. 
In a study involving 94 fifth, seventh, and ninth graders, Fisher (1973) used 
sentence-combining exercises based on 12 transformations followed later in 
the course by reversing the process to uncover kernel sentences. The students 
who were exposed to the sentence-combining practice scored significantly 
higher on the Stanford Paragraph Meaning Test. Hughes (1975), in a study at 
the grade seven level using sentence-combining, used cloze scores as well as 
the speed and accuracy subtest from Gates-MacGinitie Reading Test to 
measure reading gain. Hughes reported a nonsignificant trend towards 
improved reading comprehension. In the Combs (1975) study, a number of 
reading measures were taken. Despite the significantly better posttest writing 
of the experimental group, there was no effect on the scores of the 
comprehension subtest of the Gates-MacGinitie Reading Test — Survey E. 
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However, Combs did report a significant advantage for the experimental 
group over the control group on a specially constructed reading comprehen- 
sion measure. 


Stotksy (1975) suggested the hypothesis that “enhanced syntactic skills 
through writing activities leads to improved reading comprehension” (p. 61). 
Based on the evidence above there is no clear answer yet to this hypothesis. 
The study reported here is designed to shed more light on the question. This 
study with grade seven students was designed to determine the effect of 16 
weeks of sentence-combining practice on the syntactic maturity of students’ 
written compositions and on their reading comprehension. The study was a 
2x 2 factorial design, with the treatment being one factor and sex the other 
factor. Pretest and posttest measures of both dependent variables were taken. 


Sample 


All of the 58 students (30 female and 28 male) in two grade seven classes at 
Vanway Elementary School were randomly assigned to experimental and 
control groups. Of the original 58 students, three moved away during the 16- 
week period. Three girls had to be discarded from the data base to make equal 
sized cells. The analysis is therefore based on 26 students in the experimental 
group and 26 students in the control group. No attempt was made to control 
the outside language experiences of the two groups. The students from both 
groups participated in their regular language arts activities in their own 
classes. Both regular classroom teachers who worked with the subjects used 
the study of novels extensively in their language arts program. One of the 
classroom teachers had the students keep a writing journal in which they 
wrote on assigned, guided, or self-selected topics in a scheduled period each 
week. The assumption is made that the randomization procedures balanced 
these and other outside language experiences. 


Procedures 

Each group met with the investigator for two 45-minute periods per week 
for the 16 weeks of the study. Two periods were lost with each group because 
of holidays and teacher professional days. 

The experimental treatment which was called Sentence-Combining was 
modelled on O’Hare (1973, 1975). During the 30 periods of the intervention, 
students completed 26 lessons on sentence-combining. The other four periods 
were interspersed as catch-up, review, and extension lessons. The sentence- 
combining problems were presented on mimeographed sheets and were 
completed in a notebook reserved for Sentence-Combining. The O’Hare (1973, 
1975) format of presenting combining signals was used in this study, with one 
change. In Lesson No. 13, a change was made in the word signals from upper 
case to lower case letters. This eliminated the tendency of many students to 
leave the signal word capitalized in the final sentence. The following is an 
illustrative sentence-combining problem. 

Mr. Brown told John SOMETHING. 

Only he could decide SOMETHING. (that) 

He was going to settle down and do sufficient 

work to pass. (when to) 

The control group, which also met with the investigator for 30 periods 
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during the 16 weeks, was called Language Arts Growth and Improvement. 
The control group’s activities consisted of three aspects: 


(a) Recreational Reading — including the writing of short summaries and 
orally sharing these with the class. 


(b) Reading for Understanding — Junior (S.R.A., 1971) — The students 
started from their own level and completed 40 cards during the 
intervention period. 


(c) Pattern Writing Practice — examining and writing noun and verb 
phrases and then using them in sentences. 


Measurement 


Two dependent variables were measured in this study: (a) the syntactic 
maturity of the students’ written compositions and (b) the students’ reading 
comprehension. As shown by the review of the language development studies 
(O’Hare, 1973), there has been an extensive search for a measure of the 
developing ability of students to combine more and more kernel sentences. 
The mean length of T-units has been shown to be a sensitive measure of this 
developing syntactic maturity (O’Hare, 1973). In order to measure the 
syntactic maturity of a representative sample of each of the subjects’ free 
writing, the first 15 T-units from three compositions written by each student 
were analyzed. The T-unit definition used was that of one main clause plus 
any subordinate clause or nonclausal structure attached or embedded in the 
main clause. When one word was omitted from a fragment, the word was 
supplied and the fragment counted as a T-unit. The mean number of words/T- 
unit per student was calculated by dividing the number of words in the 45 T- 
unit sample by 45. 


The comprehension subtest of Gates-MacGinitie Reading Test — Survey E 
was used as the reading comprehension measure in this study. Three 
alternate forms of the test are available. Form 2 was used as the pretest 
measure and form 3 was used as the posttest. The raw scores of the 
comprehension subtest were used for analysis. 


Results 
Syntactic Maturity 


The mean words/T-unit for the total experimental group changed from 
9.77 at pretest to 10.67 at posttest, while the total control group changed from 
10.05 to 10.11. 


Table 1 summarizes the results of a two-way ANCOVA with repeated 
measures on the pretest and posttest measures of syntactic maturity for the 
experimental group. As indicated in the table, there is a significant difference 
(p <.05) between pretest and posttest mean words/T-unit. The experimental 
group did make a significant increase on the measure of syntactic maturity of 
their written compositions. 


However, Table 2 summarizes the results of two-way ANCOVA on posttest 
mean words/T-unit of both growps covarying on their pretest scores. As 
indicated in the table, there was no significant difference between the 
experimental and control group’s mean words/T-unit at posttest. However, a 
trend (p = .098) in favour of the experimental group is evident. 
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TABLE 1 


PRETEST AND POSTTEST MEAN WORDS/T-UNIT: 
EXPERIMENTAL GROUP — STATISTICAL SUMMARY 


Source df MS F p 
Between Ss 25 
Sex il 0.098 0.021 0.885 
Error 24 4.630 
Within Ss 26 
Pre-Post ill 10.661 5.928 0.023% 
Sex x Pre/Post il i2D85 Ae218 0.051 
Error 24 1.799 
Total byl 
8 [dese 5@S 
TABLE 2 


TWO-WAY ANCOVA POSTTEST MEAN WORDS/T-UNIT: 
STATISTICAL SUMMARY 


Source df MS r p 
Groups 1 6.416 Pets) 0.098 
Sex af 4.601 2.04 0.160 
Groups x Sex 1 4.243 1.88 ORIG, 
Error 47 ING 9)5) 


Reading Comprehension 

The mean comprehension score for the total experimental group changed 
from 27.77 at pretest to 33.27 at posttest. The mean comprehension score for 
the total control group changed from 28.69 to 33.15. 

Table 3 summarizes the results of two-way ANCOVA with repeated 
measures on the pretest and posttest measures of reading comprehension for 
the experimental group. As indicated in the table, there is a significant 


TABLE 3 


PRETEST AND POSTTEST COMPREHENSION SCORES: 
EXPERIMENTAL GROUP — STATISTICAL SUMMARY 


Source df MS F p 

Between Ss 25 
Sex iL 108.164 0.737 0.399 
Error 24 146.805 

Within Ss 26 
Pre-Post af B93 ee 35.989 0.000003* 
Sex x Pre/Post it 0.048 0.004 0.948 
BETO 24 10.926 

Total Sk 


a 


* p< .001 
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TABLE 4 


TWO-WAY ANCOVA POSTTEST COMPREHENSION SCORKS: 
STATISTICAL SUMMARY 


Source df MS iF jp 
Groups i 11.664 0.40 OM531 
Sex 1 D3 229 0-79 Onsn7 
Groups <* Sex 1 16.625 0. 57 0.455 
Error 47 29.5251 

TABLE 5 


PRETEST AND POSTTEST COMPREHENSION SCORES: 
CONTROL GROUP — STATISTICAL SUMMARY 


Source df MS F p 

Between Ss OS) 
Sex 1 | 24.921 ORAS 0.708 
Error 24 WIPE (OBI 

Within Ss 26 
Pre-Post IL 258.743 14.001 0.001009%* 
Sex X Pre/Post il 19.719 1.067 Onsie2 
Error 24 443.543 18.481 

Total Sul 

ofa) Ss (Oil 


difference (p < .001) between the experimental group’s pretest and posttest 
comprehension scores. The experimental group did make a significant gain 
on the measure of reading comprehension. 


However, Table 4 summarizes the results of two-way ANCOVA on posttest 
comprehension scores of both groups covarying on their pretest scores. As 
indicated in the table, there was no significant difference between the 
experimental and control groups on the posttest comprehension scores. 


The significant growth for the experimental group (Table 3) is not shown 
to be significant when compared at posttest with the control group. To 
examine possible reasons for this, the pretest and posttest comprehension 
scores for the control group were analyzed. The summarized results in Table 5 
show that the control group also made a significant (p < .01) gain on 
comprehension scores. 


Conclusions and Discussion 


The results show that the sentence-combining practice group made a 
significant increase in both the syntactic maturity of their written com- 
positions and their reading comprehension. However, neither of the increases 
is significant when compared at posttest with the control group. The present 
study has failed to achieve the degree of improvement in students’ writing 
that was achieved in the O’Hare (1973) or Combs (1975) studies. It is 
important to note that the O’Hare study lasted from October to May while the 
present study ran only from mid-January to mid-May. The effect of sentence- 
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combining practice over a longer time may lead to significant differences 
between the groups. 


The improvement of the sentence-combining practice group in reading 
comprehension was significant. The fact that the control group also made 
significant improvement in reading comprehension was not unexpected. The 
sentence-combining practice was only one aspect of a total language arts 
curriculum. Hopefully, many aspects of that curriculum contribute to 
improved reading comprehension. The improvement by the control group 
does not diminish the fact that sentence-combining may contribute to 
improved reading comprehension. The change in the sentence-combining 
signals from upper case to lower case seems to have facilitated the students’ 
ability to produce well formed final sentences in the sentence-combining 
problems. 


The present study is limited by the size of the sample, the length of the 
study, and the fact that both groups were instructed by one teacher. Further 
research on the effects of sentence-combining practice on syntactic maturity 
of written compositions and on reading comprehension is necessary before 
sentence-combining is implemented on a large scale. Moreover, research with 
larger samples of students and other teachers, conducted over a longer period 
corresponding more closely to O’Hare’s (1973) study, should be conducted. 
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The Relationship Between Teachers’ 
Experimental and Dogmatic Attitudes, 
and their Pupils’ Self-Concept 


The purpose of this study was to examine teachers’ effect on fourth-, fifth-, and 
sixth-grade pupils’ self-concept when they were taught by two extreme, 
significantly different groups of teachers: one group as the most experimental 
or the least dogmatic, and the other as the least experimental or the most 
dogmatic. The result of the study was that pupils who were taught by the most 
experimental group of teachers had significantly higher self-concept than 
pupils who were taught by the least experimental group of teachers. 
Implications were drawn from the study. (Dr. Cheong is in the Education 
Department at Mount Saint Vincent University, Halifax, and Mr. Wadden is 
Principal of Lake View School, Halifax County.) 


In teacher effectiveness research, Mitzel (1960) suggested that three 
criteria be used: (a) presage variables (teacher personality, knowledge, status 
characteristics); (b) process variables (teacher behaviours, pupil behaviours, 
and teacher-pupil interactions); and (c) product variables (primarily measures 
of pupil change). Following Mitzel’s theoretical model, McDonald and Elias 
(1976) investigated teacher effectiveness, and by means of stepwise 
regressions and path analyses, they found significant relations in teacher 
effectiveness and pupils’ achievement in reading and mathematics. Also 
using Mitzel’s model as the basis, Cantrell, Stenner, and Katzenmeyer (1977) 
combined all three criteria simultaneously, and their finding was that first- 
grade teachers characterized by high knowledge of behavioural principles 
were more verbally positive with their classes and produced significantly 
higher residual achievement gain results for low and middle IQ pupils than 
did first-grade teachers characterized by low knowledge of behavioural 
principles. 
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Rodin and Rodin (1972) researched teacher effectiveness and found that 
students rated most highly those faculty members from whom they learned 
the least. Cunningham’s (1975) concern was the compatibility between 
teachers’ teaching styles and learners’ learning styles by using pupil 
achievement as the criterion; his findings were that certain types of teachers 
were significantly more effective with one type of student than with another, 
and that all student types seemed to have at least one type of teacher with 
whom they achieved significantly greater success. Apart from finding linear 
relationships between teacher behaviour and pupil performance, Clark, 
Snow, and Shavelson (1976), and Soar and Soar (1977) found in a few cases 
curvilinear relationships between teacher behaviour and pupil performance. 


Costin, Greenough, and Menges (1971) reviewed numerous studies on the 
relationship between students’ achievement and teacher behaviour, but the 
results were highly varied. Some investigations found positive correlations, 
others reported negative correlations, and still others found no significant 
relationships. A recent study by Armento (1977) is a typical example of 
generalisations summarized by Costin and his associates. 


Using Protocol Modules dealing with sensitising teachers’ awareness of 
pupil self-concept as the treatment, Borg (1977) found no significant 
improvement in pupil self-concept. However, this study, basically in 
agreement with Mitzel’s (1960) theoretical framework, was concerned with 
the impact made by certain types of teacher personality attributes upon the 
acquisition of self-concept by pupils. 

The kind of teacher personality attributes involved here is the experimen- 
tal attitude expounded by John Dewey and later schematized by Brown 
(1968). Two implicit assumptions of the study which should be pointed out are 
that experimental attitude is a desirable trait for teachers to possess, and that 
teachers are the central figure in a formal teaching-learning situation. 


The purpose of this study was to investigate the impact of the most 
experimental or the least dogmatic teachers and that of the least experimen- 
tal or the most dogmatic teachers upon their pupils’ acquisition of self-concept 
after an exposure of treatment for one semester. Therefore the hypothesis of 
the study was that grade four, five, and six pupils taught by teachers more 
experimental in attitude achieve significantly higher scores in self-concept 
than pupils taught by teachers less experimental in attitude. 


To measure pupils’ self-concept, Piers-Harris Children’s Self-Concept 
Scale was used, which consists of 80 items. The authors (1964) reported split- 
half reliability coefficients ranging from .87 to .90, and KR 21 coefficients 
ranging from .78 to .93 for this measure. This instrument was developed to 
measure broad, general self-concept dealing with physical, social, academic, 
and emotional dimensions. Self-concept is derived from the child’s own 
perceptions of himself (herself), the child’s perceptions of how his (her) peers 
perceived him (her), the child’s perceptions of how his (her) family perceived 
him (her), and the child’s perceptions of how his (her) teacher perceived him 
(her). 


Method 


A city, a large town, a small town, and a rural area were involved in the 
study, giving a reasonably good indication of the cross-section for the 
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Province of Nova Scotia. However, it should not be misconstrued to mean that 
the sample was random and representative of population of the entire 
province. 


By flipping a coin, Form A rather than Form B of Personal Beliefs 
Inventory and of Teacher Practices Inventory of the Experimentalism Scale, 
consisting of forty items each (Brown, 1968) was chosen for the study. And on 
the basis of Rokeach’s (1956, 1960) results of an item analysis, the best forty 
items of the Dogmatism Scale were used in the study. The equivalent-forms 
and test retest reliability coefficients of Personal Beliefs Inventory of the 
Experimentalism (X) Scale range from .55 to .78, whereas those of Teacher 
Practices Inventory of the X Scale range from .56 to .94. The Dogmatism (D) 
Scale possesses internal consistency and test retest reliability coefficients 
ranging from .58 to .93. Therefore the reliability coefficients for both 
instruments were from moderate to high. 


Before the 1972-73 academic year began, the Experimentalism Scale and 
the Dogmatism Scale were administered by the investigators to all grade four, 
five, and six teachers, except specialists in such subjects as Music, Art, 
French, and Physical Education, in their respective locations in early 
September of 1972. In total, 168 teachers participated. 


One point that needs to be made explicit is that it was the experimental 
design of the study to exclude all specialist teachers because it was felt that 
they did not spend sufficient hours with pupils and since the study was 
designed to measure regular classroom teachers’ impact on pupils. Schools 
which implemented subject teaching (one teacher teaching only one subject to 
upper elementary grades) were also excluded, again because it was felt that 
they did not spend sufficient hours with pupils. Teachers who had taught the 
same classes the previous year were excluded, since the study was designed to 
measure teachers’ effect upon pupils for a specific period of time. 

As soon as teachers’ scores on the X Scale and the D Scale were available, 
eleven teachers with the highest scores on the X Scale and the lowest scores 
on the D Scale were chosen to constitute the high group of the study, and 
eleven teachers with the lowest scores on the X Scale and the highest scores 
on the D Scale were selected to constitute the Jow group. The reasons for 
truncating eleven pairs of teachers from the score continuum were: (a) the fact 
that there would be more than 500 pupils in the study, an adequate number 
for statistical purposes, and (b) the study was not funded by any agency. A t 
test (Guildford & Fruchter, 1973) was then performed; these two groups of 
teachers were significantly different on the X Scale (¢ = 7.80, df= 20, p < .01) 
and the D Scale (¢ = 13.96, df = 20, p < .01). 

An informal examination was made of the twenty-two teachers chosen for 
the study with respect to their sex, age, and the school systems where they 
taught. It revealed that there was representation of male and female teachers, 
old and young teachers, and teachers teaching in the city system as well as 
those teaching in rural schools. 

In mid-September of 1972, the Piers-Harris Children’s Self-Concept Scale, 
“The Way I feel about Myself’ (80 items) was administered by the 
investigators to twenty-two classes as the pre-test. In mid-February of 1973, 
the same scale was given to the same twenty-two classes (N = 544, with 
complete data) as the post-test. 
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Results 
Data were analyzed by means of analysis of covariance (Nie, Hull, 
Jenkins, Steinbrenner & Bent, 1975) with the pupils’ pretest scores on the 
Piers-Harris Children’s Self-Concept Scale as the covariate. The results are 
presented in Table 1. 


TABLE 1 


ANALYSIS OF COVARIANCE ON FOURTH-SIXTH GRADE PUPILS’ PRE- AND 
POSTTEST SCORES ON THE PIERS-HARRIS CHILDREN’S 
SCALE BETWEEN THE HIGH AND LOW GROUPS 


Source of 


Wart ation Sum of Squares df Mean Square F 
Covariates 
Pretest Scores 586.766 1 586.766 3.974% 
Main Effects 4433.508 iL 4433.508 30.028%* 
Residual 160194.941 1085 147.645 
byes <S h0)S) 
kk pp < .001 


The F ratio was significant at .001 level. This finding indicates that pupils 
taught by the most experimental or the least dogmatic teachers had a 
significantly higher self-concept at the end of one semester than pupils taught 
by the least experimental or the most dogmatic teachers. Therefore the 
hypothesis was accepted. 


Brown (1968) validated the X Scale against the criterion measure D Scale. 
As mentioned earlier, the X Scale comprises Personal Beliefs Inventory- 
Forms A and B, and Teacher Practices Inventory — Forms A and B. The 
concurrent validity coefficients ranged from .26 to .36. It is true that these 
validity coefficients, in spite of their being significant beyond .01 level, are 
quite moderate in magnitude. But it is generally recognized that substantial 
validity coefficients are difficult to obtain. 


A second point related to the use of the pretest scores as the covariates, 
which requires some clarification, is that the pretest scores were inserted into 
the design to remove extraneous variation from the posttest scores, thereby 
increasing measurement precision. The finding indicated that a significant 
amount of variance in the posttest scores is attributable to the pretest scores. 
However, the computer program was designed in such a way that the 
analysis of variance was performed on the “corrected” scores, that is, 
“corrected” posttest scores which were our primary concern in terms of 
measuring the impact made by teachers in the high group and those in the 
low group upon their respective pupils’ self-concept over a period of one 
semester. 


Discussion 
It might well be argued that children’s self-concept is positively correlated 
with their school achievement. On the basis of the findings of this study, it 
can be safely concluded that the types of teachers, specifically in terms of 
their personality or to use Mitzel’s scheme (1960), the relation between the 
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presage criteria (teachers’ cognitive preparation and affective traits), the 
process criteria (the instructional process), and the product criteria (pupils’ 
acquisition of cognitive skills and affective attitudes), had significant impact 
upon pupils’ self-concept which is positively correlated with their achieve- 
ment in school. It appears that it would be sensible to pay attention to the 
affective aspect in teacher preparation. Nevertheless, it is realized that there 
is greater difficulty in measuring validly and reliably the affective traits than 
the cognitive skills. Perhaps this is why, in teacher education heretofore, close 
attention seems to have been given to the cognitive aspects such as GPA 
and/or scores on the National Teacher Examination. It is now apparent that 
educators can no longer afford to ignore or to pay lip service to the affective 
dimension of teacher education. Since children’s self-concept is a crucial 
determinant for their achievement in school, and the types of teacher 
personality do make a difference, as much attention as possible should be 
given to the affective dimension of teacher education programs as well as of 
teacher candidates. 
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NEW CAREER OPTIONS FOR WOMEN: A COUNSELOR’S 
SOURCEBOOK. By Helen S. Farmer and Thomas E. Backer. New York: 
Human Sciences Press, 1977, 349 pp., $16.95. 


Educationalists have been aware, for some time now, of the way in which 
unquestioned cultural assumptions about sex-linked aptitudes and abilities 
have damaging effects upon the career choices made by, and the options 
made available to women as opposed to men. The bias that results from 
stereotypic assumptions affects both sexes, but in both personal terms and in 
terms of the issue of option and choice, greater limitations are placed upon 
women than upon men. A particularly disturbing observation is that those 
whose special task it is to advise people about their future labour market 
participation are often the worst offenders in the preservation and 
dissemination of outdated information and stereotypic, but unproven, 
assumptions about their clients’ abilities based on their sex and thus, the 
labour market “niche” to which they might best be suited. Consequently, any 
material that might help change the status quo in both counselor attitudes 
and beliefs, and counseling techniques, is to be welcomed. 


Farmer and Backer have produced a comprehensive review of women’s 
position in the work world of the United States, but more importantly have 
produced a book that attempts to chart the “how to...” process in terms of the 
often special needs of women in their relation to the work world, and in terms 
of the counselor’s own approach to the issues. The underlying assumption 
made by the authors is that all persons can achieve greater self-determination 
through the realization of the attitudes, behaviours and the social environ- 
ment effects that may limit “optimal development and freedom to choose.” In 
the pursuit of such an end the counseling process takes on a critical 
dimension irrespective of sex, but at the present time, most critically in terms 
of the female sex. 


The authors initially undertake a review of the current position of women 
in the labour force in the United States, with some international comparisons 
and an examination and contradiction of some of the myths regarding 
“women’s work.” The presentation, in an admirably succinct manner, of 
current findings from sociological and psychological research is coordinated 
with references to inventory sources and testing programmes. This informa- 
tion comprises the first half of the book. 
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A major problem, however, is apparent in the presentation of the 
information mentioned above. Observing the concern of the authors, and of 
some practitioners in the field regarding existing counselor attitudes and 
bias, the presentation of information is remote for any discussion of the ways 
in which counselors might alter their attitudes and procedures. Such a 
criticism takes into account the problems any author confronts in organizing 
the material and information to be covered. In a book designed as a 
pragmatic aid to counselors, by-passing the opportunity to indicate the 
significance of the theories and facts presented for the day-to-day realities of 
the counseling situation and the concomitant problems faced by the client is a 
grave oversight. For example, there is a short section on child care, and some 
scattered references elsewhere to the issue of the working mother, but in the 
section devoted to the question of child care the authors conclude with the 
observation that, at present, “the available resources meet only 18 per cent of 
the need.” Obviously any advice given to the working mother has to be given 
with this reality in mind, but the issue is not addressed by the authors in their 
subsequent discussion of the special needs of the working mother. For this 
reason, duplicated with regard to other particular concerns, needs and 
problems of the working women, it is probable that the book will be used by 
the majority of counselors more as a reference guide for programmes and 
local, professional, or union interest contacts, than for its initial presentation 
of the problems and needs of working women. Alternatively, others who 
relate theory and practice are likely to already comprise the “converted.” A 
great pity, if such is the case, particularly in terms of the aforementioned 
concern with the documented prevalence of counselor bias with regard to 
“women’s work.” 


In the presentation of the resources and agencies available to the 
counselor and client, Farmer and Backer have produced a well-organized 
catalogue for the U.S. In the sections dealing with specific counseling 
techniques and resources an array of interesting and, in some cases, 
innovative, ideas is outlined. But the discussion is guided by the assumption 
that the counselor has almost unlimited access to funds, and awesome power 
to effect change in a multitude of institutional settings. Such optimism on the 
part of the authors is both understandable and excusable given the 
assumptions underlying their efforts, but this does not alter the fact that such 
optimism is frequently misplaced in the reality of the counseling process. 
Perhaps in the pragmatic interest of the day-to-day counseling effort, some 
Nader-like investigation is required indicating the ways in which problems 
such as inadequate funding or even access to clients are frequent and might 
be overcome. 

Assessing the book for Canadian education lists and counselors, it is 
estimated that about half of the book — that devoted to an assessment of 
current research findings and an examination of “special needs” — will be of 
direct interest. The remaining half of the book contains specific information 
of limited applicability in Canada, such as the text and discussion of some of 
the issues raised about the Equal Rights Amendment. In this context, one 
may conclude that a similar source book with more concern to the 
presentation of its material is long overdue for Canada. 


R.A. Sydie 
Department of Sociology 
The University of Alberta 
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ADULT EDUCATION AS A FIELD OF STUDY AND PRACTICE. By 
Duncan D. Campbell. Vancouver: University of British Columbia Centre for 
Continuing Education. 1977, 230 pp., $5.00. 


This book is a report of a study Mr. Campbell has recently completed on 
preparatory training for those involved in adult education. A review of such a 
study should examine its purposes, describe its procedures, and react to its 
outcomes. 


This question emerges as one reads the report: Did the author start with a 
conceptualized program for the preparation of adult educators which he 
wished to test through the process of critical enquiry or was he searching for a 
theoretical framework within which to construct a model program? While the 
answer to this question is not clear and perhaps unimportant, one suspects 
that the study is intended to verify preconceived opinion. 


The procedures followed in the study involved an extensive survey of the 
literature on adult education. The author has listed, under references cited, 
some hundred and seventy titles including books, journal articles, govern- 
ment documents and unpublished theses and papers. From these he has 
drawn a book-length series of direct quotations, paraphrasings and 
descriptions. He has organized this series around such themes as the product 
to be achieved through training, the processes and practices of training, and 
preferences which lead to the concluding recommendations of the report. The 
study is an excellent illustration of reflective enquiry. The challenge in this 
type of research is to achieve objectivity in selecting representative readings 
from the universe of existing literature on the topic and to analyze 
insightfully the opinions and practices expressed in those readings. On both 
counts Mr. Campbell has done a workmanlike job. 


The author commences his study with a brief but adequate survey of adult 
education in Canada, an appropriate beachhead from which to attack the 
existing chaos of the field. He deals immediately with the diversity of 
definitions currently in use, each definition revealing a different perception of 
the present and future role of adult education. He does so by subsuming all of 
these definitions under the rubric of adult education thereby disposing of the 
need to examine the revolutionary implications of such descriptive terms as 
lifelong learning, nontraditional studies and community education. 
Otherwise he might have been forced to undertake three studies and write 
three books instead of one! 


Mr. Campbell dismisses the European design in the training of adult 
educators which has been strongly influenced by the perception of adult 
education as a social instrument for improving the lot of powerless people. He 
chooses rather to view preparation as a “deepening of knowledge grounded in 
theory connected to and illumined by experience.” This view leads inevitably 
to familiar patterns. His proposed training program is set within the contexts 
of psychology, sociology, philosophy and history with overtones of 
methodology, organization and administration. 

Throughout the report the author stresses the need for research in the field 
of adult education. This he uses as his major argument for insisting that 
training programs for adult educators be located within universities. The 
conclusion is doubtless valid but hardly original. The same argument was 
used to justify the transfer of teacher education from the normal school to the 
university over thirty years ago. There were many more compelling reasons 
for this transfer historically than a pious recognition of the need for research. 


While current writers in adult education are treading well worn paths, 
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these do not apparently lead to the same destination. A most interesting 
phenomenon appearing in the literature is the move to dissociate the 
preparation of teachers for adults from that of teachers for public and high 
school youth. Some universities in both North America and Europe are 
presenting what purports to be a new discipline, known as andragogy. This 
discipline stresses an instructional methodology designed for adult students, 
one which cannot become functional apparently in close association with 
those who are dedicated to the principles of pedagogy. 


While Mr. Campbell refers only briefly to the discipline of andragogy he 
apparently agrees with its distinctions. His conclusion is that training for 
adult educators should not be set within regular faculties of education. To 
support this he describes practices followed in England where training in 
methodology is closely related to the subject of instruction. American practice 
is to locate training programs for adult educators within faculties of 
education. Nonetheless, some American writers cited by the author are 
opposed to this, insisting that it works to the disadvantage of adult educators- 
in-training. Coolie Verner, former head of the Centre for Continuing 
Education at the University of British Columbia, an independent unit, 
vigorously defends the practice in existence there. Mr. Campbell cites this 
defence in support of his recommendation. 


It is difficult to establish rationally based evidence as justification for 
those positions or practices. The evidence presented in the report is loaded 
with value judgments. The methodology advocated by ‘“‘andragogues” does 
not differ appreciably from that advocated by many teacher educators. For 
instance, the concept of teacher as a facilitator of learning rather than a 
director of instruction is not unknown to those who prepare teachers for the 
elementary and high school grades. 


As an alternative, Mr. Campbell proposes that training programs for 
adults be placed with community development. This proposal has merit if one 
were to define adult education as a facet of community education, one phase 
only, albeit an important one, of community improvement. That adult 
education should become the handmaiden of community development does 
not appear among the assumptions of this report. 


Some English writers identify the future role of adult education as 
encompassing recurrent periods of learning over the lifetime of an individual. 
If this should happen these writers forecast a paradigm shift in education. We 
now expect that formal education will terminate for most people well before 
the age of twenty-five. If each individual is guaranteed an entitlement of 
sixteen to eighteen years of formal schooling with the option of taking the 
post compulsory years according to personal preference we are presented with 
a completely different view of schooling arrangements. The conventional 
education-work-leisure-retirement sequence would be dramatically altered. 
Any move in this direction would further blur distinctions between 
preparatory and continuing education. 

The study contains a compendium of knowledge about adult education. 
Many of Mr. Campbell’s analytical interpretations of this wealth of 
information are exceedingly germane. He has made a useful contribution to 
adult education as a field of study. 


T. C. Byrne, Visiting Professor 
Department of Educational Administration 
The University of Alberta 
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PEOPLE, CULTURE AND LEARNING. By John W. Friesen. Calgary: 
Detselig Enterprises Ltd., 1977, 255 pp., $12.25. 


In People, Culture and Learning, the author, John W. Friesen, has set an 
ambitious goal of combining theory with practice within the area of 
intercultural education with the focus being on the “ phenomenon of culture.” 
If this book is judged strictly in terms of this goal, it will be seen as a less 
significant contribution than if it is more modestly viewed as an attempt to 
provide an awareness of and insight into the cultural distinctiveness of three 
ethnic groups in western Canada. 


Friesen has broken the book into two parts. Part One, consisting of five 
chapters, is an attempt to provide a theoretical framework for an analysis of 
intercultural education. Part Two, including seven chapters, consists of case 
studies of the Plains Indians, Hutterites and Mennonites. A selected 
bibliography, organized according to topics, has been provided which will be 
useful to a student unfamiliar with the area of intercultural education. 


That the book has been published by an unknown local enterprise may be 
due in part to the fact that a major proportion of the material is neither totally 
new nor unavailable. Of the twelve chapters, eight are essentially articles 
which have appeared between 1971 and 1976 in readily available journals. 


As was indicated earlier, Friesen has failed mostly in dealing with the 
theoretical components of intercultural education. What he presents as the 
sociological base constitutes little more than an affective response to the 
importance of intercultural education. One may well agree with the 
humanistic conclusions about majority-minority relations that Friesen 
reaches, but the reasons for these conclusions are based more on a messianic 
commitment than a sociological analysis. Like so many other authors in this 
field, Friesen has relied on an orientation which is practical but, 
characteristically, atheoretical. Of the sociological and anthropological 
works cited, few represent contributions to the field, most citations being at 
least ten years old. Any reader approaching the book with a background in 
any of the social sciences will be less than satisfied with the theoretical 
conceptualization and the application of the “phenomenon of culture.” 
Although Friesen has discussed dominant and variant value orientations in 
Chapter Five, he has made no attempt to provide an analytic discussion of the 
dynamics of culture. Furthermore, clarity in the use of concepts is lacking. 
For example, intercultural, multicultural and pluralistic are used in- 
terchangeably. 


A final criticism of Part One of the volume is that it has failed to view 
majority-minority relations in a pluralist society within an arena of 
differentially distributed power. Chapter Two, concerning legal and 
theoretical considerations, which could have been the basis of a discussion of 
power, failed somewhat due to its descriptive and nonanalytic nature. 


On a more positive note, Chapters Three, Four, and Five provide an 
interesting descriptive account of the development and growth of inter- 
cultural education and the role of values in intercultural education. However, 
these do not contribute to the theoretical component of the book as promised 
by the author and would have been more reasonably part of a general 
introduction to the volume. 

Part Two is the book’s saving grace. Here Friesen has discussed at length 
three religio-ethnic groups, the Plains Indians, the Hutterites and the 
Mennonites. Evaluating the treatment of the three groups according to the 
author’s objectives, one must reach the conclusion that the most successful 
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case study is that of the Plains Indians. The strength of this analysis is in the 
fact that Friesen has discussed the Indians’ characteristics extensively and, 
equally important, has looked at their needs, wants, and aspirations — a very 
important part of an understanding of cultural dynamics in a pluralistic 
society. It is this latter characteristic which is lacking in the treatment of the 
Mennonites and Hutterites. One learns who they are and how the majority 
groups in society view them but one gains little understanding of how they 
appraise their world in terms of the power dimension, with all its political and 
economic manifestations. Knowing this, one could more realistically 
approach the philosophical questions Friesen has raised in Part One of the 
volume. Taken together, the discussion of the three groups is informative and 
compassionate. The references for further reading in these areas are well 
chosen and will provide the reader with a readily available source of 
supplemental readings. 


In conclusion, Friesen has presented a study of three religio-ethnic groups 
which has been done with concern and compassion. For the beginning 
teacher who must be sensitized to cultural differences between pupils who are 
likely to be members of different ethnic groups, People, Culture and Learning 
represents a readable account of some such differences. However, the serious 
student of intercultural education will find this volume lacking in that it 
presents no clear theoretical orientation which can be used in focussing on 
culture or on relations of groups within a pluralist society. Unfortunately, this 
book is likely, at best, to encourage toleration of differences between minority 
groups in the name of humanism but will provide little understanding of the 
dynamics of minority-majority group relations. Surely it is the latter that is 
needed in understanding the problems of and possibilities for Canadian 
society. 


John R. Young 
Associate Professor 


Department of Educational Foundations 
The University of Alberta 


COGNITION, CURRICULUM, AND COMPREHENSION. By John T. 
Guthrie (Ed.). Newark, Delaware: International Reading Association, 1977, 
300 pp., $8.50. 


There are a number of difficulties in reviewing this book: (1) The articles 
contained in the book are constrained by the format of the conference where 
these papers were first presented. Therefore it might be more logical to review 
the conference than to review the resulting manuscript. (2) Rather than 
expressing unity of thought, the book contains ten independent papers, most 
of which are followed by a reaction paper. The final paper attempts to be a 
review of all the papers and their reactions. 

Since evaluation only makes sense in terms of objectives, it is necessary to 
consider the objectives of this book — in actual fact, the consideration of the 
conference objectives. The conference known as “The Development of 
Reading Comprehension,” was held in Newark in the summer of 1975. The 
Preface states that “‘... the conference was intended to explore basic research 
and the teaching of comprehension” (p. xiii). “Basic research” was not 
defined. An analysis of the papers, however, indicates that whereas some 
papers were based completely around a research study in which the author 
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was involved (Wilder, p. 257), other papers did not include any research by the 
author (Athey, p. 71). One may also question the purpose of the reactions of 
one author to the paper of another author. There does not appear to have been 
any guide-lines as to what constitutes a reaction. The reactions seem to range 
from a paraphrase of the paper with additional support, and alternate 
suggestions (Hansen, p. 65) to adulation (Carroll, p. 157) to an apparent 
attempt to insert one’s own point of view (Perfetti, p. 279). 


The Foreword is very difficult to understand and is rather pretentious and 
nondirective. “This volume represents a confluence of significant current 
trends in the scientific study of reading processes and practical aspects of 
curriculum design”’ (p. ix). “Highlighted in this book is the concern with the 
advanced competencies of reading and verbal literacy subsumed under the 
general name of comprehension” (p. x). “. . . the ineffectiveness of many 
curricular reforms... (is) based upon ignorance of the nature of the cognitive 
activities involved in acquiring knowledge from text and reasoning with 
written language... This point of view is the theme of this volume’ (p. xi). 


The individual papers range over a number of topics of interest to the 
authors and depending on the reader’s background, will provide varying 
degrees of information. The recency of some of the research quoted may be 
questioned and recent parameters of discourse analysis (propositions, 
topic/comment, new/old information, staging) are lacking. 


Carroll’s (p. 1) is a general paper and he advises assessing independently 
the influence of reading, language, and cognition by means of aw- o-cscale, 
where w - 0 - c stands for written - oral - cognitive. However, he does not 
adequately distinguish these three components except that the last two are 
involved in the first. Perfetti (p. 20) does not present a conceptual definition of 
reading and at one point (p. 24) equates reading with phonological 
processing. His identification of Levels I, I, III do help the reader focus on 
certain components to be considered when children are reading. The choice of 
words in this article makes it difficult to read at times. For example, “Since 
time is continuous are we dealing with degrees of conscious effort or with 
some fairly sharp distinction between automatic and effortful” (p. 24) and 
“On the other hand it is adaptive to forget the words after a certain point” (p. 
33). Hansen (p. 65) provides a good review to the Frase (p. 42) paper which 
states “that learning outcomes are strongly related to the properties of the 
goals given to readers and their own focus of attention” (p. 65). The Athey (p. 
71) paper attempts to analyse the relation of syntax and semantics to reading. 
The majority of articles are devoted to syntactic studies, most of which are 
older and well-known. In reviewing these studies, she does not make an effort 
to distinguish between production and reception, oral and written, but tends 
to describe the studies in chronological order. Although she makes the 
statement that “syntactic development does not consist of a series of 
structures which evolve in a programmed sequence unrelated to the 
development of meanings” (p. 78), she does not criticize syntactic research 
from this point of view. And whereas later in the paper she talks about 
semantic constraints, she does not apply this to her discussion of the 
Chomsky study. Whereas children may have difficulty in understanding such 
structures as “The doll is easy to see,” they may have no difficulty with “The 
chair is easy to see.” Hood (p. 101) presents a good paper with much 
information, some based on her own research. However, she does not attempt 
to fit language development into a comprehension framework. Beck (p. 113) 
provides a good evaluation of the Sullivan Reading Series and demonstrates 
how some of its weaknesses may be overcome by examining and specifying 
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basic concepts of the reading process and curriculum design. The construc- 
tion of the New Primary Grade Reading System (NRS) would have had 
greater transferability for other designers of curriculum had she been more 
explicit on the relationship of the components of the program to the sequence 
of steps indicated within the reading process model. Hansen and Lovitt (p. 
160) discuss applied behaviour analysis methodology as a new approach for 
conducting research in reading. Some components (e.g., direct measurement) 
have long been a part of criterion reference testing. Unlike Beck, the authors 
of this paper do not have any theory of reading to guide their research 
interpretations. Rate appears to be a major criterion of successful reading. 
The results of some of their studies (regardless of the methodology) appear to 
be “givens.” For example, ‘The ability to comprehend is related to grade 
level” (p. 167) and “When difficulty of the material increases the pupil’s 
ability to read and comprehend gradually worsens” (p. 171). Otto (p. 193) 
argues for a skills approach in reading instruction. Whereas he makes such 
statements as “Once we concern ourselves with sensibly chosen skills” (p. 
196) and “ ... sensibly selected skills” (p. 198), he does not indicate the 
research on which these skills were chosen and arranged hierarchically. 
Stauffer (p. 242) sets forth his views on reading as a cognitive act but these 
are based on his 1969 book, Directing Reading Maturity as a Cognitive 
Process. Wilder (p. 257) presents an information paper on the United States 
Office of Education’s commissioning Educational Testing Service in 1971, “to 
plan and conduct a ‘descriptive and analytic study of compensatory reading 
programs in U.S. public schools’” (p. 257). Conclusions must be considered in 
the light of the limitations of research design and these are adequately raised 
by Perfetti in his review of this paper. However, either the word “exemplary” 
is ambiguous or there is a contradiction in the conclusions, for Wilder states 
“The five schools were chosen because their reading programs were 
‘exemplary’”’, but “In short, nowhere among the five schools was there 
apparent to us a definable concentrated set of principles or procedures related 
to instruction in comprehension” (p. 277). 

The final paper by Guthrie which refers to some of the ideas presented in 
the volume and to other existing research and draws implications for 
instruction, would have provided a much better conclusion for the book had it 
synthesized the key ideas presented by the text contributors, and focussed on 
the broader issue which is raised in the Preface — the interrelation between 
research and curriculum design (as opposed to instruction). 


No rationale is provide for the sequence of papers which may be a factor in 
comprehensibility. One rationale might be to begin with language develop- 
ment and then focus on reading as language and develop a concept of reading 
comprehension. This might be followed by research studies on particular 
aspects of reading comprehension, to be followed by the implications of this 
knowledge for curriculum design. With this rationale the following sequence 
is recommended. 

(1) Hood (p. 101) on language development. 

(2) Carroll (p. 1) and Stauffer (p. 242) on defining comprehension. 

(3) Athey (p. 71) on linguistic parameters of reading. 

(4) Frase (p. 42), Hansen and Lovitt (p. 160) and Perfetti (p. 20) on their research in 

reading. Perfetti is last because he moves into decoding (pronunciation). 

(5) Beck (p. 113), Otto (p. 193), and Wilder (p. 257) on curriculum design. Beck 

follows Perfetti because the reading curriculum she describes originated as a 
code emphasis reading program. 
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Perhaps the greatest difficulty in this book serving its purpose results from 
the lack of clarity between the conference (oral language) where the papers 
were first presented, and the text (written language) where the ideas are more 
widely disseminated. With regard to purposes of conferences, Carroll states “ . 
.. to get participants to rethink their objectives and methods; to get them to 
learn new terminology and concepts; review the latest research findings... 
and go away refreshed with the thought that, on arrival back home, they can 
lick all the problems that previously seemed insoluble” (p. 159). Guthrie 
makes the point that “Communication between researchers and practitioners 
may be enhanced by posing three questions to each group: 1) Do you have 
messages for your counterparts in reading? 2) Are these messages consistent? 
3) Are the messages important, that is, worthy as a basis for change?” (p. 
283). Had Carroll’s guidelines been given to the conference participants and 
Guthrie’s to the participants as manuscript writers, the book would have 
fulfilled a more definite purpose. 


William T. Fagan 
Department of Elementary Education 
The University of Alberta 
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Post-Secondary Aspirations of High School 
Seniors in Different School Contexts: 
A Canadian Study 


Using questionnaire data from grade twelve students in the Regional 
Municipality of Durham (Durham Region), Ontario, Canada, this study 
examines school contextual effects on levels of post-secondary aspirations. A 
linear regression model sorts out simultaneous effects of the independent 
variable, school environment, on the dependent variables, levels of 
educational and occupational aspiration, with relevant control factors held 
constant. Results show that, even though school environmental effects are 
trivial and unreliable, the variables parental expectations, family 
socioeconomic status, and school peer group do make _ substantial 
contributions to the variance in both dependent variables. (Dr. O’Neill is 
Assistant Professor of Education in the College of Education at Brock 
University.) 


Within the past twenty years, sociologists and educators have started to 
look more closely at both the distinctive traits of collectivities and the 
consequences that membership in particular groups can have on attitudes 
and behaviours of their respective members. Much of this work has been 
devoted to identifying and isolating the ‘‘contextual effects” that account for 
variability in post-secondary aspirations of similar individuals in 
distinctively different groups. The contextual effect itself concerns the 
socialization of an individual with a specific environmental setting or milieu. 


The importance of such research has generally been related to the fact 
that pupil populations of a region, or school, can be a useful resource in the 
accomplishment of certain educational objectives. For example, Coleman, 
Campbell, Hobson, McPartland, Mood, Weinfeld, and York (1966) suggested 
that there were optimal patterns of student allocation, in some districts, 
which could materially assist in the attainment of particular goals (i.e., 
through busing students to affect an integration of certain racial, ethnic, and 
socioeconomic groups). Hence, the magnitude of contextual effects, and 
specifically their magnitude when compared with that of other more easily 
manipulative variables, should be of considerable interest to researchers and 
practitioners of social and educational theory. 
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In this study, an attempt is made to examine differential levels of 
educational and occupational aspirations among youth in different school 
contexts in the Regional Municipality of Durham (Durham Region), Ontario, 
Canada. More precisely, the focus of this investigation is towards a 
contextual explanation of variability in high school seniors’ post-secondary 
educational and occupational aspirations. This explanation is presented by 
way of construction and quantification of a linear regression model which 
incorporates influences due to certain personal and social psychological 
variables and the social-structural variable school context. 


Background to the Study 


Although school contextual effects may empirically conform to a variety 
of patterns, they have generally been interpreted in terms of reference group 
theory. According to this interpretative paradigm (Kelly, 1968; Kemper, 
1968), a person adjusts his behaviour and beliefs to the current modes and 
codes that seem to be most salient within his membership group(s). The 
major operational assumption of this theoretical interpretation is that 
specific individuals or collectivities, commonly referred to as significant 
others, are of such internal importance that the subject tries to meet the 
expectations which are thought to be especially needed in order to be socially 
acceptable. As Kemper (1968) states, 
Achievement is powerfully related to the types of reference groups available to an 
individual, and that lacking these groups, his achievement striving will be 
seriously hampered. (p. 31) 

Kemper (1968) continues, 
In general, a reference group is a group, collectivity, or person which the actor 
takes into account in some manner in the course of selecting a behavior from 
among a set of alternatives, or in making a judgment about a problematic issue. A 


reference group helps to orient the actor in a certain course, whether of action or 
attitude. (p. 32) 


Thus, the interpretative paradigm assumes that an individual or a mass of 
individuals (role exemplar{s]) with certain values and/or goals must be 
present within a given context (in this case, school environment) if 
interpersonal influences are to operate effectively. This critical normative 
mass should, in fact, provide the circumstances under which certain types of 
actions are deemed worthy of emulation. 


In terms of aspirations, then, particular sets of collective orientations may 
promote or inhibit individual student’s interest and engagement in future 
academic endeavours, and in this way serve as resources in the educational 
and occupational decision making process (i.e., whether to undertake 
post-secondary education, and at what level). In other words, school 
contextual effects on levels of aspiration are assumed to depend upon the 
responses of individual students to the presence or absence of predominant 
trends in the plans of “generalized others” in the high school. 

To date, several studies (Boyle, 1966; Coleman et al., 1966; Hauser, 1969; 
McDill & Coleman, 1965; McDill, Meyers, Jr., & Rigsby, 1967; Michael, 1961; 
Myer, 1970; Nelson, 1972; Turner, 1964; Wilson, 1959) have attempted, in 
varlous ways, to test this paradigm in different school settings. From a 
review of the literature, two facts emerge rather clearly. First, there is little 
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consensus among the researchers. On the one hand, studies by Boyle (1966), 
Coleman et al. (1966), McDill and Coleman (1965), Michael (1961), Turner 
(1964), and Wilson (1959) maintain that school effects, albeit small in some 
instances, are important factors in determining differential levels of 
achievement or aspiration. On the other hand, studies by Hauser (1969), 
McDill et al. (1967), Myer (1970), and Nelson (1972) claim that the magnitude 
of school environmental effects are negligible factors in determining 
students’ overall levels of achievement or aspiration. 


Secondly, there has generally been a lack of Canadian contextual studies. 
Indeed, of all the works cited, only one (Boyle, 1966) used Canadian subjects. 
The need for more comparative Canadian research has been continually 
stressed over the years by concerned social scientists. As far back as 1957, for 
example, Fleming, in having to base his conceptualization extensively on 
American studies, regretted this state of affairs and implied the need for 
systematic local research. He subsequently participated in one of the 
pioneering Ontario efforts on the subject of post-secondary education 
(Atkinson Study of Utilization of Student Resources, 1957) which helped 
unravel some of the specific processes impinging on educational aspirations 
among Ontario students. Still, eight years passed, and matters did not 
drastically change. As Fleming (1965) protested, 

It must also be kept in mind that most of the thousands of studies reported have 
been carried out in the United States. It is characteristic of this type of study that 
local factors play an important part... . It is hard to discover a creditable reason 
why investigation in Canada should have lagged behind. (pp. 8-9) 
More recently, Pike (1970) reiterated the Canadian case by stating that, 
“Unlike the United States, Canada has not been anxious to turn the light of 
empirical social research on educational problems and processes” (p. 10). 


In summary, then, there is need for more local Canadian contextual 
studies because: (1) generalizations are hampered by disagreement in the 
literature, and (2) there has generally been a lack of Canadian contextual 
research. 


Development of the Model 

General Design 

The problem of isolating social-structural effects from individual effects is 
of crucial concern in contextual research. Spurious environmental effects, 
known as statistical artifacts, can result unless the following two factors are 
considered. First, there is the problem of intercorrelations among the 
individual control variables and the contextual independent variable (school 
environment). Such intercorrelations are referred to as multicollinearity 
(Gordon, 1968, p. 596). Stated in interrogative terms, the problem becomes: Is 
school environment independently associated with both educational and 
occupational aspirations, or are the zero-order associations an artifact of the 
common relationship of aspirations and school environment to one or more 
of the individual control factors? 


Secondly, there is the problem of contaminative effects due to the uneven 
distribution of certain individual attributes within different structural units 
of the contextual independent variable. For example, intelligence (mental 
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ability) has been shown to be differentially related to levels of aspiration. 
That is, more students with high mental ability ratings aspire to a university 
education than do those with low mental ability ratings (Berdie, 19538; 
Breton, 1972; Bordua, 1960; Kahl, 1953; Pavalko, 1967; Sewell, Haller, & 
Straus, 1957). If mental ability ratings within each of the different structural 
units are unevenly distributed, then mental ability ratings could be either an 
advantage or a disadvantage for a particular school. Stated in question 
format, the problem becomes: Do subjects in school environment A have 
fewer university aspirations than subjects in school environment B because 
of contextual effects within their respective high schools, or is the uneven 
distribution of intelligence within structural units the main factor? 


The first problem can be solved by employing a technique which will sort 
out the unique variance accounted for in the dependent variables by school 
environment. This can only be accomplished by holding all relevant 
individual control variables constant simultaneously. Similarly, the second 
problem necessitates the use of multiple controls. All categories of the control 
factors known to be differentially related to aspirations, while at the same 
time suspected to be unevenly distributed within units of the independent 
variable, must be equalized or matched within all units of independent 
variable. 


According to Tannenbaum and Bachman (1964), multiple regression is 
one statistical procedure which can greatly minimize spurious effects in 
contextual studies. It will not only sort out collinearity effects among the 
independent and control variables, but it will also eliminate contamination 
effects within categories of the independent variable. In multiple regression, 
the analysis becomes the evaluation and measurement of overall dependence 
of a criterion variable on a set of other variables. The multivariate procedure 
used in this study is part of the Statistical Package for the Social Sciences 
(SPSS) (Nie, Hull, Jenkins, Steinbrenner, & Bent, 1975). 


Identifying the Control Variables 


In addition to intelligence, four other variables are known to be 
differentially related to levels of aspiration: family socioeconomic status, sex, 
parental expectations, and school peer group. Evidence indicates that family 
socioeconomic status, however measured, is clearly related to differential 
levels of aspiration. In short, students from middle class socioeconomic 
households are considerably more likely to plan on university and seek high 
status jobs than those from working class socioeconomic households (Anisef, 
1973; Breton, 1972; Clark, Cook, Fallis & Kent, 1975; Kahl, 1953; Marsden & 
Harvey, 1971; Pavalko & Bishop, 1966a; Porter, Porter & Blishen, 1973; 
Sewell et al., 1957; Sewell & Shah, 1968b; Wilson & Buck, 1960). 


Furthermore, male students are, on the average, much more likely to have 
higher educational and occupational aspirations than female students 
(Anisef, 1973; Fleming, 1957; Haller & Sewell, 1957; Karmel, 1975; Marsden & 
Harvey, 1971; Pavalko & Bishop, 1966a; Porter et al., 1973; Sewell, 1964). 

Also, some social scientists have contested that family socioeconomic 
status is at best a crude indicator of the home’s influence on post-secondary 
aspirations of high school seniors. More explicitly, while it is recognized that 
socioeconomic class is strongly related to aspirations, it is believed that 
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parental encouragement (expectations) for higher education and occupation 
can be an overriding class factor. In fact, a number of works dealing with the 
importance of the family as a socializing agency have taken the stance that, 
even though many working class families are relatively disadvantaged, they 
nonetheless raise their children according to the pattern more commonly 
~ associated with the middle class ideal (Boocock, 1966; Bordua, 1960; Cohen, 
1965; Sewell & Shah, 1968a, 1968b; Rehberg & Westby, 1967). 


Berdie (1954), Breton (1972), and Elder (1965) carry this theme one step 
further. They maintain that family influences on adolescents’ attitudes 
towards university and the professions provide a greater impetus for 
university attendance than the general economic facilities of the family. 
Evidence from studies by Kahl (1953) and Simpson (1962) tend to support this 
claim. 


Kahl studied a sample of twenty-four working class boys from public high 
schools in Metropolitan Boston, and found that the crucial differences 
between the twelve who aspired to go to university and the twelve who did 
not was the amount of parental pressure for university attendance among the 
former group. Parental pressure was almost nonexistent among the boys who 
lacked higher educational aspirations. Likewise, Simpson, in a southern 
United States study of occupational aspirations of high school seniors, found 
that for both middle and lower class boys, parental influence was more 
strongly related to aspirations than socioeconomic class. Simpson (1962) 


states, 
... parental advice is a much better indicator of high ambition than is the boy’s 


social class. Only 21.0 per cent of the unambitious middle-class boys and 16.0 per 
cent of the nonmobile working-class boys had been advised by one or both parents 
to enter professions; this compares with 53.1 per cent among the ambitious 
middle-class boys and 43.5 per cent among the mobile working-class boys (p. 519). 


It would seem, therefore, that parental expectations are more accurate 
barometers of family influence than is the socioeconomic class of the family. 
Indeed, parental encouragement can become a vehicle through which the 
parents translate their achievement and mobility values into a role 
expectation comprehensible to the adolescent—that he or she is to continue 
his or her education beyond high school. 

Thus, to type students solely on the basis of their socioeconomic class, as 
some contextual studies have done, may provide an extremely inaccurate 
picture of the home environment. At the same time, however, the inclusion of 
the variable, parental expectations, should not exclude the variable, family 
socioeconomic status. With parental expectations controlled, family 
socioeconomic status merely becomes a more precise measure of one feature 
of the home milieu relating to education, namely, the type of role model that 
the parents present for the child to emulate. 


Finally, the variable, school peer group, was included in the model as an 
alternative mechanism for linking the social-structural unit to the individual. 
The variable has been shown to be highly correlated with both levels of 
achievement and aspiration (Alexander & Campbell, 1964; Bain & Anderson, 
1974; Beilin, 1956; Duncan, Haller & Portes, 1968; Haller & Butterworth, 
1960; Herriott, 1963; Pavalko & Bishop, 1966b). More precisely, the more 
post-secondary bound peers a student perceives as having (specific others), 
the more likely that student will want to pursue a post-secondary education. 
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Hence, it was speculated that school peer group might have unique and 
statistically significant effects on aspirations separate from those of the 
other control variables and the independent variable school environment. 

This study remains unparalleled, then, in that it examines the school 
social structural question from two perspectives: generalized others and 
specific others. That is, social interaction effects on aspirational levels are 
not only measured and assessed indirectly through the dominant 
socioeconomic composition of the high school population (contextual effects), 
but directly through the control variable school peer group (sociometric 
effects). 


Quantification of the Model 


The regression model includes five control variables, one independent 
variable, and two dependent variables. The operational definition of each 
variable is presented along with a brief description of how it was measured. 
The five control factors are: mental ability (intelligence), family 
socioeconomic status, sex, parental expectations, and school peer group. 


Mental ability. The variable intelligence was based on scores from the 
Culture Fair Intelligence Test (Cattell & Cattell, 1960a, 1960b). High, 
medium, and low mental ability ratings were obtained by dividing all raw 
scores into near equal thirds. 


Family socioeconomic status. Students were divided into low, middle, and 
upper socioeconomic status levels according to their father’s occupational 
rating(s) on Blishen’s Socioeconomic Index for Occupations in Canada 
(Blishen, 1971). Three socioeconomic classes were arrived at by combining 
seventy, sixty, and fifty ratings into high socioeconomic status (upper class), 
forty ratings into medium socioeconomic status (middle class), and thirty 
and twenty ratings into low socioeconomic status (working class). 

Sex. For computer purposes, males were assigned a one and females were 
given a two. 

Parental expectations. Students were asked to respond to questions which 
recorded their perceptions of their parents’ attitude toward their future plans. 
The variable was trichotomized into low, medium, and high encouragement 
categories. The low category signifies no or little post-secondary expectation, 
the middle category indicates diploma granting colleges, and the high 
category includes degree granting universities. 


School peer group. The variable was dichotomized into positive and 
negative socializing effects. Students who indicated that “most to all” of 
their friends were planning on a post-secondary education were considered to 
have a positive peer group impetus. Contrariwise, students with “few to no” 
post-secondary oriented friends were classified as having a negative peer 
group impetus. 

The contextual independent variable, school environment, was based 
upon the dominant socioeconomic composition of the high school student 
body. Schools with predominately middle and upper class clientele were 
classified as having middle class milieus. Schools composed of mainly 
working class students were considered to have working class environments. 


This information was first obtained from the principals and later verified by 
tabular analysis. 
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The two dependent variables in the study are educational and 
occupational aspirations. The former variable refers to students’ 
post-secondary educational plans while the latter variable refers to students’ 
vocational or occupational ambitions after school. 


The concept, level of aspiration (Haller & Miller, 1971; Haller, Otto, Meier 
& Ohlendorf, 1974), includes an orientation towards a goal. The person’s goal 
is a selection of one among the alternative behaviour levels that are possible 
with respect to an objective. The alternative behaviour levels must vary in 
the degree to which they are difficult to achieve. That is, the alternatives 
must be ranked in a continuum of difficulty. The person’s choice is fluctuant 
in that its central tendency may lie at any point or limited range of points 
along the continuum of difficulty. The central tendency of the person’s 
orientation is that point or limited range of points which has the highest 
valence for that individual. This is the person’s level of aspiration. The term 
differential level of aspiration logically implies variation in the point of 
valence when it is estimated at different times on the same person, or at same 
or different times on different persons. In this study, the term is restricted to 
variations among different persons at the same time. 


The concept “level of occupational aspiration” (LOA) is a special instance 
of the more general term. It differs from the general concept only in that it 
takes as its object the occupational hierarchy, and the continuum of difficulty 
consists of the various levels along the hierarchy. Level of occupational 
aspiration was measured by Haller and Miller’s (1971) Occupational 
Aspiration Scale (OAS). The range of possible scores on the OAS is from zero 
to seventy-two. Low, medium, and high levels of occupational aspiration 
were obtained by dividing all raw scores into approximate thirds. 


The concept “level of educational aspiration” (LEA) is likewise a special 
case of the more general concept. It differs from the general concept only in 
that it takes as its object the post-secondary educational hierarchy, and the 
continuum of difficulty comprises the various levels along the hierarchy. The 
hierarchy contained nine alternatives: trade school with an apprenticeship 
in the field, agricultural college, business school or college, institute of 
technology, military college, the Ontario College of Art, theological 
seminary, community college, and university. 

A threefold classification system developed in which the definitely 
decided and partially decided were given precedence over the undecided. A 
university choice definitely or partially decided was given a high rating. An 
undecided university choice was given a medium rating. Selection of a 
diploma granting college was also given a medium rating if the student 
indicated either a definite or partial decision to attend. Low ratings were 
given to college undecideds, to those who had no post-secondary aspirations, 
and to those who planned to pursue a trade with an apprenticeship in the 
field. 

A schematic representation of the complete regression model is illustrated 
in Figure 1. Possible intercorrelations between the five control factors are 
shown by arrowed lines. The bar line between the control variables and the 
independent variable, school environment, is intended to show that there is 
only one independent variable in the model, and that high schools, City W1, 
City M1, Town W2, and Town M2, are merely structural units of this variable. 
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Control Variables Independent Variable Dependent Variables 
(Possible intercorrelations among the five factors) (School Environment) (Aspirational levels) 


Socioeconomic status + 
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Figure 1. The complete regression model. 


The letters W and M stand respectively for working class and middle class 
socioeconomic environments. Finally, the inverted pyramids capped with 
arrows between the four school units and aspirational levels indicate that 
there are two dependent variables in the model, namely, educational 
aspirations and occupational aspirations. 


The Hypothesis 


The model tests the premise that the bulk of students in a high school 
(indexed by the dominant socioeconomic composition of the student body) 
provides a significant normative influence on students’ overall levels of 
aspiration when relevant control factors are simultaneously controlled. In 
general terms, the inquiry asks: 


1. Does school environment have a significant influence on students’ 
post-secondary aspirations? 


2. Does the zero-order association persist when relevant control factors 
are simultaneously controlled? 


3. How much variance does school environment account for over and 
above that accounted for by the control variables? 


In response to the questions, the alternative hypothesis assumes the 
affirmative. That is, school environment will have a significant influence on 
students’ post-secondary aspirations. More precisely, 

Students will have, on the average, higher educational and occupational 


aspirations in middle class high school environments than in working class high 
school environments. 


Methodology 
Study Site 


The study was conducted in high schools under the jurisdiction of the 
Durham Board of Education in the Regional Municipality of Durham 
(Durham Region), Ontario, Canada. Durham Region stretches from Lake 
Ontario in the south to the Talbot River (east shore of Lake Simcoe) in the 
north and from Northumberland and Victoria counties on the east to 
Metropolitan Toronto and York Region on the west. 


. The region itself can be divided into two geographic and economic zones: a 
highly industrialized, urbanized, southern corridor and a northern rural 
farming area. There are no ethnic enclaves in the region, although there has 
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been a small influx of new immigrants in the last ten years. Low rental 
housing is minimal, as are elite dormitories. The small-towns to the north 
and the larger cities to the south serve as drawing areas for shopping, 
recreational, and business activities. 


The region and surrounding areas offer its citizens ample opportunities 
for post-secondary education. Within easy driving distance are three 
universities and three community colleges. Trent University is located to the 
east at Peterborough, while Scarborough College, University of Toronto, is 
located to the immediate southwest. Durham Community College is located 
within the Oshawa-Whitby corridor, while to the east again at Peterborough 
is situated Sir Sandford Fleming College with a branch campus at Lindsay. 
To the north, at Orillia, is Georgian Community College and the Wilfrid 
Laurier University extension program. 


Data Collection 


Four high schools were selected according to the following four criteria: (1) 
they had to have near equal enrollments, (2) they had to offer the three main 
streams or programs up to and including grade twelve (academic, vocational, 
and business and commerce), (3) the overall student body socioeconomic 
composition had to be predominately working class in one of the small-town 
and city high schools and predominately middle class in the other two high 
schools, and (4) all high schools had to be under the same administrative 
unit. This helped minimize resource differences in schools by placing some 
control over the distribution and utilization of various facilities within 
structures. In addition, it gave a degree of uniformity to such variables as 
student-teacher ratio, guidance and counselling aids, methods of evaluation 
and promotion, as well as to the operational goals underlying school 
curriculums. : 

The Occupational Aspiration Scale (OAS), the Culture Fair Intelligence 
Test, and the Survey Schedule (a self-made instrument) were administered to 
all grade twelve students within each of the four high schools in the late fall 
and early winter of the 1975-76 school year. In total, the researcher polled 
seven hundred and fifty students. This represented a return of approximately 
ninety per cent across the four clusters. 


Data Analysis 

Two types of analyses, using the Statistical Package for the Social 
Sciences (SPSS) (Nie et al., 1975), were performed on the data: tabular and 
multiple regression. Tabular analysis established zero-order relationships 
between each of the control factors and the independent variable and the two 
dependent variables respectively. Zero-order correlations, using chi-squares, 
confirmed or refuted statistically significant relationships, and control factor 
distribution within categories of the independent and dependent variables 
were ascertained by an examination of individual tables. In addition, 
Crameyr’s correlation coefficient (Blalock, 1972, p. 230) was employed to 
approximate the strength of each statistically significant association. One 
could then test for linearity, a prerequisite in the use of linear regression, and 
assess the rough magnitude of each control factor on the independent and 
dependent variables. 
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Although tabular analysis was useful in mapping out separate effects, 
there was need for a more precise estimation of the relative influence of each 
variable in terms of a standardized measure. Further, this measure had to be 
strictly comparable from one stage to another while taking into account the 
full range of available data. Multiple regression, in the form of one-way 
analysis of variance, with option nine (SPSS, ANOVA), met this requisite. In 
option nine, all effects, including main effects, covariate effects, and any 
interaction effects are assessed simultaneously as in straight multiple 
regression (SPSS, REGRESSION). Each effect is the additional contribution 
to the explained variation in the dependent variables after adjustment has 
been made for all other effects. 


The values of the dependent variables were predicted from the linear 
function of the form, 


'=A+BiXi+BoX+... BoXe, 


where Y’ represents the estimated value of Y, A is the Y intercept, and Bi are 
partial regression coefficients. Bi stands for the expected change in Y witha 
change of one unit in X; when X>to Xs are held constant. Expressed in 
another way, B: is the expected difference in Y between two groups which are 
different on X: by one unit but are the same on X2to Xe. Similarly, Bs (the 
independent variable) stands for the expected change in Y with a unit change 
in X6 when X, to Xs are held constant (Gordon, 1968; Kerlinger, 1973). 


The A and Bi coefficients were selected so that the sum of squared residuals 
> (Y-Y’)? is minimized. This least squares criterion suggests that any other 
values of A and Bi would yield a larger > (Y - Y’)?. Further, selection of the 
optimum A and B; coefficients using the least squares criterion intimates that 
the correlation between the actual Y values and the Y’ estimated values is 
maximized, while the correlation between the independent variables (controls 
included) and the residual values (Y - Y’) is reduced to zero. 


The overall null hypothesis, Ho: R = 0, is equivalent to the null hypothesis 
that all k regression coefficients are equal to zero in the population. More 
precisely, Ho: Bi = B2 =... Bk = 0. The alternative hypothesis, stated in terms of 
population regression coefficients, is Hi: Bi# 0 for one or more i. (For a 
detailed discussion of multiple regression analysis, see Jae-On Kim and 
Frank J. Kohout in Nie et al., 1975, pp. 320-342.) 


Results 


A series of regression equations were employed to measure rough and 
refined effects of high school environment on both educational and 
occupational aspirations. The first equation assessed the gross magnitude of 
school effects on each dependent variable. The words “gross magnitude” are 
used because the individual factors were not controlled for either separately, 
successively, or simultaneously. Results are shown in Tables 1 and 2. 

Although both F values in Table 1 are greater than one, neither is 
statistically significant. Thus, even with the control factors excluded, school 


environment does not have a statistically significant relationship with either 
dependent variable. 
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TABLE 1 
ONE-WAY ANALYSIS OF VARIANCE WITH OPTION 9, OCCUPATIONAL 
AND EDUCATIONAL ASPIRATIONS BY HIGH SCHOOL, 
ALL COVARIATES EXCLUDED? 


Sum of Squares D/F F Value p* 
Occupational Aspirations 
Environmental effects 356 Si) 5} 1.88 21350 
Residual AOle225 726 
Total 484.962 729 
Educational Aspirations 
Environmental effects DSwrL 3 LESS) A Ke) 
Residual AD Oe DO, 726 
meoitarl: 456.088 729 


* 750 cases were processed. 20 cases (2.7%) were missing. 


* The statistical significance of the relationships examined throughout the 
analysis is determined by appropriate tests using the .05 probability 
HeviCls (Olgs s.105)). 


TABLE 2 
UNADJUSTED DEVIATIONS OF THE FOUR HIGH SCHOOLS AROUND 
THE GRAND MEANS OF OCCUPATIONAL AND 
EDUCATIONAL ASPIRATION 


Occupational Unadjusted Educational Unadjusted 
Aspirations Deviations Aspirations Deviations 
12 City W1 -0.02 (3.)* lee City Wi OfOLUGa!) 
eee Cichy Mil: O03 C2") 2a Galt yeh. 0s (Cah) 
See clowns WZ 51S) (CAs) 3. Town W2 -0.07 (4.) 
4. Town M2 OvO Sars) 4. Town M2 OOD Els) 
Grand Mean = 2.00 Grand Mean = 2.18 
Multiple R2= .008 Multiple R2= .005 
Multiple R= .088 Multiple R = .074 


? In this and all successive tables, numbers in parentheses give the 
hierarchical order of each structural unit. 


The nonsignificant contributions of each structural unit are given in 
Table 2. The unadjusted deviations are the means of each structural unit 
expressed as deviations around the grand means of occupational and 
educational aspiration. The middle class small-town high school (Town M2) 
ranks high on both dependent variables, while the working class small-town 
high school (Town W2) ranks low on both dependent variables. In contrast, 
the middle and working class city high schools (City M1 and City W1 
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respectively) each have a two and a three rating. At the same time, however, 
the spread between structural units is extremely small because school 
environment accounts for only 0.8 per cent (.008 x 100 = .80) of the variance in 
occupational aspirations and 0.5 per cent (.005 x 100 = .50) of the variance in 
educational aspirations. 

In order to test the complete regression model, a second series of equations 
were generated to see: (1) which control variables were accounting for most of 
the variance in the dependent variables, (2) if the order of structural units 
around the grand means would remain constant, and (3) whether school 
environmental effects would partially remain or completely disappear. 


Tables 3 and 4 summarize the results of occupational aspiration by school 
environment with the five covariates (control factors) simultaneously 
controlled. Table 3 reveals that, of the five covariates, three have statistically 
significant relationships with occupational aspirations. Included are 
parental expectations, family socioeconomic status, and school peer group. 
The other two control factors, sex and intelligence, are equally weak as 
neither has an F value greater than one. School environment itself has an F 
value greater than one, but its association with occupational aspirations is 
not statistically significant. 

The unreliable contributions of each structural unit are displayed in 
Table 4. The unadjusted and adjusted deviations are the means of each 
structural unit expressed as deviations from the grand mean of occupational 
aspirations. The adjusted hierarchical order of the structural units in Table 4 
is the same as that of the unadjusted order in Table 2. 


The multiple R squared value of 0.266 (27 per cent) is the total amount of 
variance accounted for in the dependent variable by the additive effects of all 
the common and unique contributions of each covariate and the independent 
variable. The multiple R value (0.516) gives the strength of the multiple 
relationship between the dependent variable and the independent variable 
plus the five covariates. In this case, it is a moderately strong positive 
relationship. 


Eta squared (7 )? is the proportion of variance in the dependent variable 
explained by both the unique contribution of school environment plus the 
common or shared contributions of school environment and the five 
covariates. More precisely, it includes both the overlap variance within the 
independent variable and the covariates plus the independent variable’s 
unique contribution. Beta squared (B)?, on the other hand, is the exclusive 
contribution of school environment devoid of the covariate overlap. 
Accordingly then, school environment makes a .008 x 100 = 0.8 per cent joint 
contribution to the variance in occupational aspirations. However, when 
adjustments are made for the five covariates, its unique contribution drops to 
0.4 per cent (.004 x 100 = .40). 


Tables 5 and 6 present the results of educational aspiration by school 
environment with the five covariates simultaneously controlled. Once again, 
three of the covariates, parental expectations, family socioeconomic status, 
and school peer group, have statistically significant relationships with the 
dependent variable. Although sex has an F value greater than one, it is not 
statistically significant. Intelligence and high school environment are the 
weakest factors having F values of less than one. 
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TABLE 3 
ONE-WAY ANALYSIS OF VARIANCE WITH OPTION 9, OCCUPATIONAL 
ASPIRATION BY HIGH SCHOOL CONTROLLING SIMULTANEOUSLY 
FOR THE FIVE COVARIATES? 


Independent Variables Sum of Squares D/F F Value Pp 
Covariates WA 5 SyAe) 5 48.660 O0O1 
Parental expectations 36.428 a 73.049 001 
Family socioeconomic status 14.005 1 28.083 001 
Intelligence O 5037 1 0.065 pn sieieiee 
School peer group 16.068 il 325220 001 
Sex 0.108 1 OaPily 999 
High School Environment 1.784 8 I TES)8} ould 
Residual 338.106 678 

Total 462.964 686 


* 750 cases were processed. 63 cases (8.4%) were missing. 


* If the F value is not greater than 1, SPSS does not compute the p value. 
Instead, it prints .999. 


TABLE 4 
UNADJUSTED AND ADJUSTED DEVIATIONS OF THE FOUR 
HIGH SCHOOLS AROUND THE GRAND MEAN OF 
OCCUPATIONAL ASPIRATIONS 


Structural Units Unadjusted Deviations Adjusted for 
Covariate Deviations 


DeeeCLey. Wl -0.03 (3.) -0.03 (3.) 

2. City U1 0-03 C2:) 0: 039422) 

3. Town W2 -0.12 (4.) -0.09 (4.) 

4. Town M2 DEO (ays) OOS Gl) 
Multiple R2 = .266 


Multiple R = .516 
Eta? = (0.09)2 = .008 
Beta2= (0.06)2 = .004 
Grand Mean = 2.00 


Summary Measures 


The nonsignificant effects of each structural unit are illustrated in 
Table 6. The adjusted hierarchical order of the structural units in Table 6 is 
identical to the unadjusted order in Table 2. This time, school environment 
makes a 0.5 per cent joint contribution to the variance in educational 
aspirations, but its unique contribution drops to only 0.2 per cent when 
adjustments are made for the five covariates. Total amount of variance 
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TABLE 5 
ONE-WAY ANALYSIS OF VARIANCE WITH OPTION 9, EDUCATIONAL 
ASPIRATION BY HIGH SCHOOL CONTROLLING SIMULTANEOUSLY 
FOR THE FIVE COVARIATES? 


Independent Variables Sum of Squares D/F F Value Pp 
Covariates Sy reaOLe 5 78.631 001 
Parental expectations 83.028 1 207.437 -001 
Family socioeconomic status 3.040 Il Us S)s 006 
Intelligence 0.369 id OF9 23) 3999 
School peer group 9.435 1 2a eau -001 
Sex 0.794 il 1.984 LG 
High School Environment 0.653 3 0.544 7999 
Residual 27 eS 678 

Total 431.005 686 


* 750 cases were processed. 63 cases (8.4%) were missing. 


TABLE 6 
UNADJUSTED AND ADJUSTED DEVIATIONS OF THE FOUR 
HIGH SCHOOLS AROUND THE GRAND MEAN OF 
EDUCATIONAL ASPIRATIONS 


Structural Units Unadjusted Deviations Adjusted for 
Covariate Deviations 


Tea Clty wi 0.09.25) OLOL A222) 
Zuma C Ue ye -0.04 (3.) -0.02 (3.) 
3. Town W2 -0.08 (4.) -0.04 (4.) 
4. Town M2 0.07) Gl.) O20 5, (le) 
Multiple R* = .367 
Multiple R = .605 
Summary Measures Meee = (Oe = .OOs 


Beta2 = (0.04)2 = .002 
Grand Mean = 2.117 


accounted for is quite substantial (37 per cent). The multiple R of .605 
suggests a strong positive multiple relationship between the dependent 
variable and the independent variable and the five covariates. 


Discussion 


The findings of this Canadian study are congruent with past American 
research which found that school contextual effects are extremely minimal. 
Students in the two middle class high school environments do not have, on 
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the average, higher levels of aspiration than students in the two working 
class high school environments. School environment, as reflected in the 
dominant socioeconomic composition of a high school, does not account for 
much of the variance in either educational or occupational post-secondary 
aspirations. Indeed, even with the control factors excluded, school 
environment never contributed to more than one per cent of the variance in 
either dependent variable. Thus, it can be concluded that, in this study, 
school environmental effects are not only statistically meaningless, but 
substantively weak. 


This conclusion implies, then, that the administrative strategy of 
attempting to balance the socioeconomic compositions of a region’s high 
schools, at least in order to influence aspirations, may not be particularly 
effective. In fact, these findings suggest that no effect at all may result when 
balancing is done across schools. Yet, this is decidedly not to say that such 
balancing should not be attempted since there may be factors which should 
be considered other than the one of aspirations. For example, self- and 
school-related attitudes may be more susceptible to administrative 
manipulation of school contexts. Of course, this is mere speculation as there 
is no evidence from this study to support such a claim. 


Despite the small contributions of school environment, overall the 
regression model does have good explanatory powers. The range of total 
variance accounted for varies from a low of 27 per cent to a high of 37 per 
cent. This definitely indicates, therefore, that the model contains many 
pertinent personal and social psychological variables. 


Three control factors, in particular, accounted for most of the variance: 
parental expectations, family socioeconomic status, and school peer group. 
All three variables had statistically significant associations with both 
dependent variables. Strictly speaking, however, the joint contributions of 
parental expectations and family socioeconomic status continually 
accounted for more variance in aspirations than all the other factors 
combined. Hence, as far as this study is concerned, it appears as though the 
heart of the matter is in the conjugal (nuclear) family where many 
behavioural attitudes and values are engendered which lead to high 
aspirations and subsequent social class achievement by members of the next 
generation. 


It is also interesting to note that the control variable, school peer group, 
accounts for considerably more variance than school environment. This 
finding would seem to infer that, even though school environment 
(generalized others) in and of itself may not be an important factor, the 
adolescent clique or subgroup (specific others) is an important factor in 
determining differential levels of post-secondary aspirations. More explicitly, 
it is the student’s direct associates within the smaller school society that is 
the important factor (sociometric effects), not the overall socioeconomic 
composition of the school population itself (contextual effects). Future 
contextual studies might take this finding into account by shifting their 
primary focus from general interpersonal collectivities to specific personal 
collectivities. 

The other two control factors, sex and intelligence, did not have 
statistically significant associations with either dependent variable. Since 
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these findings contradict much previous research, a posteriori intuitive 
explanation is warranted. With the advent of women’s liberation and the 
promotion of equal educational and vocational opportunities for both sexes, 
aspirational differences between the sexes may no longer be an important 
factor. In fact, evidence from this study suggests that girls are not confining 
their aspirational levels to society’s traditional expectations. 

Similarly, intelligence (mental ability) may no longer be an important 
factor in aspirational type studies. With “educational inflation” running 
rampant, average or below average intelligence may not be seen as a barrier 
to university attendance and the highly skilled professions. Perhaps the 
criteria of academic success has shifted from high intelligence to 
determination, dedication, hard work, and enthusiasm. 


Of course, the order in which the variables are entered in the regression 
equation affects the magnitude of each factor. More precisely, if sex and 
intelligence were entered first, they would account for more variance than 
they do in their present position. Regardless of position, however, neither 
variable makes a significant contribution to the variance in either dependent 
variable. 


Variable ordering likewise accounts for some of parental expectations’ 
large contributions, but overall it remains the most important factor in the 
model. At the same time, caution must be exercised since data on this 
variable were obtained from students rather than independently from the 
parents. The danger of student report measures lies in the fact that students 
may attribute to their parents the aspirations they hold for themselves. In 
other words, questionnaire responses could become reflections of what the 
students want their parents to project rather than what the parents actually 
expect. 


According to Williams (1972, p. 118), however, evidence for the validity of 
student report data, although minimal, is nonetheless encouraging. Williams 
noted two unpublished doctoral dissertations which found that 
approximately seventy-eight per cent of the students reported their mother’s 
expectations correctly. Thus, there is some support for the assumption which 
is made, namely that, even though student reports of referent 
expectations/ aspirations are in close concordance with reality, they may be 
slightly overstated. 


In the last analysis, it is expected that the dimensions employed in this 
model will provide a focus for further Canadian contextual research. Other 
factors unique to the home, school, and community need to be identified and 
isolated as a substantial proportion of the variance in both dependent 
variables remains unexplained. Hopefully, such research will lead to a 
clearer understanding of what types of contexts are most conducive to high 
post-secondary aspirations among Canadian youth in contemporary society. 
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University of Alberta Committee for the 
International Year of the Child 


Call for Papers for an IYC Conference 


The University of Alberta Committee for the International Year of the Child is 
currently planning an interdisciplinary conference at the end of September, 
1979. The conference is intended to bring together individuals working in a 
Canadian context, who are interested in the study of the child, for the purpose of 
sharing information, generating ideas, exploring what they have in common 
and how they can support each other. The programme themes include the 
developing child, the relating child, the learning child, and the child with special 
needs. Research papers are invited under the categories of: education, health, the 
child and the law, child growth and development, the child in the family, 
children’s environments, the child in society, the child and government, the child 
and the humanities, and the child and agencies. For further information, please 
contact Conference Chairperson, Sheila Campbell, Department of Elementary 
Education, Faculty of Education, University of Alberta, Edmonton, Alberta, 
T6G 2G5, or phone (408) 4382-5428. 
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The Effect of TV Viewing on the Educational 
Performance of Elementary School Children 


This study was designed to test the accuracy of the popular assumption that 
the extent of TV viewing has a negative effect on educational performance. 
Subjects in grades III and IV (N = 102) and Vand VI (N = 198) were assessed 
on 14 variables: sex, socioeconomic status, the verbal and nonverbal IQ taken 
from the Canadian version of the Lorge-Thorndike Test of Mental Ability, 
achievement in a locally-standardized test of mathematics, the word 
meaning and paragraph comprehension sections of the Stanford Reading 
Test, two measures of behavioural impulsivity, and five measures of TV 
viewing (serious news, situation comedies, violent programmes, “kidvid” 
cartoons, and total TV viewing). Two-way ANOVAS (8 levels of TV viewing x 
3 levels of IQ) disclosed no significant TV group effects on the three 
attainment variables. Minor findings concerned the significant positive 
relationship between extent of TV viewing and a test of behavioural 
impulsivity and, on one occasion, the present measure of social class. Future 
studies might investigate these points and eliminate some of the questionable 
methodological and measurement decisions made in this _ initial 
data-gathering venture. (Dr. Anderson and Dr. Maguire are Professors in the 
Department of Educational Psychology at The University of Alberta.) 


The assumption that the extent of TV viewing, estimated by the frequency 
with which TV shows are watched by youngsters, is negatively related to 
their verbal and nonverbal attainment in school, is based on a mixture of 
factual reports and the climate of contemporary opinion. 


(1) The assumption is tacitly accepted by the people who created “Sesame 
Street.” Children are not being given the education they deserve (Lesser, 
1975, p. 9), and “By the time these children graduate from high school, they 
will have spent on the average about 12,000 hours in school and about 15,000 
hours watching television, having devoted more of their young lives to 
watching television than to any other single activity except sleep” (p. 19). 
There is an inference to be drawn about the cause-and-effect relationship 
between Lesser’s statements. 
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(2) Teachers tend to believe that TV is the obvious culprit responsible for 
the decline in educational performance partly because it reduces the time 
available for homework and partly because youngsters would rather watch 
pictures than read print or listen to “talking heads” (Epstein, 1974, p. 263). 
Indeed salient contemporary models everywhere speak and write in a 
barbarous jargon characterized by, among other faults, ugly 
circumlocutions, plain bad grammar, clichés, the ambiguous use of words 
and phrases, and syntactical irregularities (Newman, 1976, pp. 4, 23, 30, 137). 
From this the inference to be drawn is that verbal rather than nonverbal 
attainment is more likely to be negatively affected by TV viewing. 


(3) It has been asserted and illustrated that the debilitating effects of TV 
have their greatest influence on low achievers (Crandall, 1977, p. 36), a point 
which is consistent with the finding that low-income adolescents watch more 
TV than their higher-income counterparts (Liebert & Schwartzberg, 1977, 
p. 143).! 

If a relationship between income and TV viewing exists, then support for 
the relationship could be obtained by the simple process of gathering 
appropriate information from a representative sample of subjects. 
Unfortunately, the assumption and the sampling procedure seem to be at 
odds with Liebert and Schwartzberg’s other finding that “the brightest 
children were the heaviest television users” (Liebert & Schwartzberg, 1977, 
p. 143). Accordingly, it was decided that if achievement declines as a function 
of TV viewing, the evidence must be found from an analysis of the 
performance and responses of “bright” students. 


A second hypothesis concerns the relationship between behavioural 
impulsivity and heavy viewing. This idea is derived from the view that 
commercial TV, as a marketing device, must promote impulse buying by 
associating its commercials with constantly changing and novel 
programmes. Moreover, impulsive children would seem to be excellent 
candidates for poor educational performance (Keogh, 1971, p. 106). 


Method 


Subjects 

Five classes of subjects were tested and a comparison between the sample 
means and the 1975 performance of the complete school population on the 
Lorge-Thorndike Test of Mental Ability is given in Table 1. It is clear that, 
because the sample distributions are negatively skewed, they contain the 
target students, although the comparability of the standard deviations 
presents a problem, one solution to which may be that the schools chosen 
drew their students from a variety of neighbourhoods. 


Measures 

The children tested came from grades III (N = 46, 19 females); IV (N = 56, 
28 females); V (N = 108, 54 females); and VI (N = 90, 46 females) in a 
metropolitan city in Western Canada. The sample was chosen because 
information about its members was accessible and because the students, on 
the average, were markedly superior, as has been shown, to the population 
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TABLE 1 


DESCRIPTIVE STATISTICS BY GRADE LEVELS FOR 
LORGE-THORNDIKE CANADIAN VERSION 


Grades 
ikarndueay) V and VI 
Mean SD Mean SD 
Verbal IQ Sample IAL LoS) IS O72 MAS) i514 
City Norms 108.9 WEA IMO) al iS 
Non Verbal IQ Sample WL 9) 14.5 114.6 T3538 
City Norms 108.0 Ss OZ UWSja2 


from which they were drawn. They were assessed on fourteen variables: sex, 
socioeconomic status, verbal and nonverbal IQ taken from the Canadian 
version of the Lorge-Thorndike Test of Mental Ability, achievement in a 
locally-standardized test of achievement in mathematics, the word meaning 
and paragraph comprehension sections of the Stanford Reading Test, two 
measures of behavioural impulsivity (Davids, 1971; Hirschfield, 1965), and 
five measures of TV viewing. The questionnaire? about television preferences 
offered the children 52 programmes, all of which were named by other 
children not participating in the investigation. Eleven programmes such as 
news and documentaries were classed as serious; there were 19 situation 
comedies, 14 violent programmes, mostly police shows such as “Kojak,” and 
8 “kidvid” cartoons with names like “Speed Bunny,” “Hong Kong Phooey,” 
and “Groovey Ghoolies.” Although the cartoons are primarily violent in 
nature, they were placed in a separate category because they are designed for 
children. 


The subjects were asked to place marks opposite programmes they 
regularly watched and were encouraged to write in favoured programmes not 
included in the list. For each child scores were derived for each of the shows 
by tallying the marks, and the total score consisted of all the marks made. 
That is, it was decided that the extent of TV viewing was reflected in the 
frequency with which TV programmes were watched. Alternative measures 
such as the number of hours watched were excluded because the child is 
better acquainted with the programmes watched rather than their temporal 
duration. 


Socioeconomic status was estimated by placing the occupational status of 
the father (or that of the mother if hers were higher) on a six-point scale 
defined in terms of the amount of formal education received by the parent. 
Those with the highest sorts of education, usually in professional and 
managerial posts, were rated as 6 and made up 10 percent of the parents of 
children in the younger grades, 6 percent in the older; lower professional and 
administrative parents were rated as 5 (25 percent and 30 percent, 
respectively); those with some post high-school education such as typists and 
lower civil servants as 4 (26 percent and 32 percent); skilled manual workers 
such as mechanics and plumbers as 3 (29 percent and 24 percent); unskilled 
manual workers as 2 (9 percent and 8 percent) and the intermittently 
employed and the unemployed as 1 (1 percent and zero). 


158 


Effect of TV Viewing on Educational Performance 


The hierarchy was arranged in terms of the extent of symbolic education 
because it was assumed, wrongly as it turned out, that people with the 
highest symbolic skills would be least likely to allow their children to watch 
TV a great deal, a possibility that would afford some support for the 
assumption of a positive relationship between TV viewing and educational 
attainment. 


Results 
TV Viewing 


The following analyses were carried out after combining grades III with 
IV and V with VI because the differences in amount of TV viewing between 
the two younger and the two older groups were not significant whereas the 
difference between the two age-groups was very significant. The 
intercorrelations among the TV variables were calculated to ascertain 
whether a general score for TV viewing would be appropriate for further 
analyses. The results are shown in Table 2. 

There are two interesting points about these correlations: a) the 
youngsters, especially in the later grades, tend to look at any kind of 
programme; b) it is a fair claim that, since 19 of the 20 correlations are 
significant and the correlations with the total are large, a general factor of 
TV viewing, measured by the total scores previously described, can be used 
as an independent variable for testing the accuracy of the idea that a 
negative relationship exists between number of TV shows viewed and 
educational attainment — at least, for this sample. 


Preferred Sort of Programme 

Information about the programmes selected most frequently in the 
categories is given in Table 3. 

A Newman-Keuls comparison of the means showed marked differences 
among means for each of the two groups. The preference for situation 
comedies is lower in the younger group — 12 out of 19 (64 percent) compared 
to 14 out of 19 (74 percent). Heavy watching of the violent fare occurred only 


TABLE 2 


CORRELATIONS AMONG VIEWING PREFERENCES FOR STUDENTS 
IN GRADES IJJ—IV AND V—VI?® 


KIDVID SI -145 7202 Sayre! SEY) 
SERIOUS (All == -492 - 396 - 630 
VIOLENT 435 376 Sse -448 Skew 
SITCOM 489 VA) ee SSS -822 
TOTAL VIEWING aif Sylk -616 ll MY spoil == 


I 


2 Grades III-IV correlations are shown above the diagonal (WN = 102). 
Grades V-VI correlations are shown below (WN = 198). 
(All correlations except .145 are significant at the OSmlevediuyre 
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TABLE 3 


AVERAGE NUMBER OF PROGRAMMES SELECTED FROM 
EACH CLASSIFICATION 


Number of 


Grade Mean SD Programmes 
Listed 

III-IV KIDVID 6.8 Sie? 8 
SERIOUS Siodl Pods ibs 
VIOLENCE 8.9 4.0 14 
SITCOM Je} 4.9 19 

V=ViE KIDVID 5.4 Sjo Ml 8 
SERIOUS Bo’ Died iLil 
VIOLENCE 8.9 Bes 14 
SITCOM Sew, 4.5 19 


with the “kidvid” in the younger group (7 out of 8 or 89 percent); these 
cartoons were less preferred at the older level (5 out of 8 or 62 percent). 
Programmes of adult violence were a steady attraction for both groups (9 out 
of 14 or 77 percent), although fewer in number than the comedies. As might 
be expected at these ages, the “serious” programmes (news and 
documentaries) exercised little appeal. One inference to be drawn from this is 
that the shows with greatest appeal will be mixtures of situation comedy 
(laced with sex) and violence, “Starsky and Hutch” and “Police Woman” 
being obvious examples. A later item analysis, not reported here, 
corroborated this prediction. 


TV Watching and Educational Attainment 


To examine the supposed influence of TV watching on verbal and 
mathematical attainment, two procedures were adopted. First, in each of the 
two grade groups, youngsters were divided into Low (< 27), Medium (27-34) 
and High (>34) groups, based on the number of programmes watched. 
Analyses of variance were run on the three attainment variables and the 
results are shown in Table 4. In this analysis, ‘““number of TV shows viewed” 


TABLE 4 


ANALYSIS OF VARIANCE OF AVERAGE VALUES ON ATTAINMENT 
VARIABLES BY LEVELS OF TOTAL TV VIEWING 


TV Group 


Variable Grade F Ratio Probability 
Low Medium High 

Mathematics III-IV 553) 74.9 74.6 Pyle sO 
V-VI 83.7 81.6 S2ai, April .80 

Word III-IV Ile Ss bye) 5)7/ sO) 3.00 -054 

Meaning V-VIL 69.9 63.8 60.2 PAEU/S) sO/ 

Paragraph III-IV Wika) 60.7 59.0 25 AO) 

Comprehension V-VI 70.6 64.6 66.1 1.19 ols 
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appears to have only a marginal negative relationship with achievement, the 
strongest being with vocabulary. 

In the second procedure, the data were reanalyzed by blocking separately 
on verbal IQ (VIQ) and nonverbal IQ (NVIQ). Since the results were very 
similar, only those for VIQ are shown. The objective of the analysis was to 
see if the previous results were common to all IQ groups. Each of the two 
grade groups was divided into three VIQ groups: Low (<110), Medium 
(111-130), and High (>180). The frequencies are shown in Table 5. 


TABLE 5 
NUMBER OF STUDENTS IN IQ AND TV WATCHING GROUPS 


TV Groups IQ Groups 


<110 ISL e) >130 


Grades III,IV 


<27 Shows 8 18 8 34 

27-34 Shows 1a 15 8 34 

>34 Shows iL 20 3 34 
30 53 19 

Grades V, VI 

<27 Shows Df 24 10 61 

27-34 Shows 28 30 if 65 

>34 Shows 27 27), 8 62 
82 81 25) 


Chi square tests carried out on the data shown in Table 5 indicated that 
the proportion of students in the TV group for the various VIQ groups were 
not significantly different in either grades III, IV (X 2 = 3.9, p >.05) or grades 
V, VI (x2 = 1.1, p>.05). This suggests that the number of shows watched is 
relatively independent of VIQ level. 

There were no significant interactions between VIQ and TV group in 
either of the two grade groups, for any of the dependent variables 
(mathematics, word meaning, and paragraph comprehension). Of course the 
VIQ main effects were significant, but that was to be expected. Only Word 
Meaning showed a significant TV group effect at the grade III, IV level. 
There were no significant TV group effects at the grade V, VI level. The group 
means are shown in Tables 6 and 7. 


TV Watching, Behavioural Impulsivity and Social Class 

Teachers’ estimates of behavioural impulsivity of the younger children on 
Davids’ Scale* correlated significantly with extent of viewing violent 
programmes by the younger children (0.292), and also with all measures of 
TV viewing by the older children (0.350 for total viewing). This is consistent 
with the view that commercial TV, as a marketing device, promotes impulse 
buying and, in its programmes, drives away boredom by constant change 
(Schrank, 1976, p. 27). 
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TABLE 6 
ACHIEVEMENT MEANS FOR GRADES III, IV FOR VIQ AND TV GROUPS 


Verbal 1Q TV Group Mathematics anor PaueeneDe 
Meaning Comprehension 
Low Low o HOGS) 394 43.9 
Medium OB 33.4 Oa S 
High Sie) SoS) 36.8 
Medium Low Sia) Mocs) WSo5 
Medium 80.3 64.2 69sL 
High S2ia2 63.9 66.2 
High Low 94.1 88.8 Sh) ot 
Medium 80.6 86.9 86.6 
High 86-0 Silks 3} 92.0 
TABLE 7 


ACHIEVEMENT MEANS FOR GRADES V, VI FOR VIQ AND TV GROUPS 


Verbal IQ TV Group Mathematics Mord Bowsexeph 
Meaning Comprehension 
Low Low 74.9 BG} a8) On 7 
Medium 74.4 ByA Al DS 4 
High isos: 47.5 ys)..3) 
Medium Low 89.3 ile 80.3 
Medium 2553 67.6 O.s! 
High 87.8 66.8 Hes 
High Low Bhs) 89° 1 sey aal 
Medium 3358) 87.0 98.4 
High 8)3}.(0) 80.9 80.3 
TABLE 8 


ANALYSIS OF VARIANCE OF TV VIEWING BY SOCIOECONOMIC 
STATUS (GRADES V AND VI) 


TV Groups N Mean SES SD 
Hi 61 G08, Wok 
M 65 Heyl a0) F = 2.96, p = .054 
Lo 62 Sholay) SS) 


The relationship, which came close to significance in the performance of 
the older groups, between TV viewing and the present measure of 
socioeconomic status is presented in Table 8. In a two-way ANOVA with the 
verbal IQ, the F-ratio was increased to 4.13 and the probability reduced to 
.018. This casts doubt on the original assumption that heavy TV viewing 
would characterize lower SES groups. It should be remembered, however, 
that the present SES measure is based on educational background. 
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Discussion 


Limited conclusions can be drawn from an initial investigation in this 
area. Further studies of this sort are required at this and later age-levels; the 
amount of time spent watching must be compared to the shows seen as an 
index of TV viewing, and interviews conducted to determine the common 
features of the heavy viewer, especially the bright but bored (and possible 
impulsive) student. Again, the same tests should be given to the counterparts 
of members of this group, i.e., those whose school achievement is relatively 
poor. A replication of the present finding with such a group would support the 
tentative conclusion that the general assumption about the unpleasant effect 
of heavy TV watching may not be accurate. 


Notes 


1. It is possible to contend that this relationship between low-income youngsters and TV viewing is artificial and 
that the factors which cause this poor educational achievement are the lack of books, cultural events and 
parents who reward achievement. However, in this investigation, the findings do not support Liebert and 
Schwartzberg. 


2. The questionnaire contained a list of 44 TV programmes, and the following instructions were given to the 
child: “Place a mark (y ) next to every TV programme which you usually watch (when it is on) because you like 
ab. 

3. Davids’ Scale contains 7 traits isolated by clinicians as being associated with hyperactivity. The rater 
assigns a value of between 1 and 6 to each trait, 1 indicating a low occurrence of this behaviour in the child 
and 6 a high occurrence. The rated behaviours include hyperactivity, impulsiveness, irritability, and so on. 
The maximum score for all seven items is 42. 
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Attitudes Toward Mathematics and 
Mathematics Instruction of Prospective 
Elementary Teachers 


Three Likert-type scales were used to investigate the attitude toward 
mathematics and beliefs about mathematics and mathematics instruction of 
prospective elementary teachers in three stages of preparation at the 
University of Victoria. Results indicate that students who have completed a 
content and a methods course in the mathematics preparatory program have 
more informal beliefs about mathematics and how it should be taught than 
students beginning the content course; t.e., they have a greater appreciation 
of the creative side of mathematics and stronger beliefs in the importance of 
discovery in learning mathematics. High achievers had more positive 
attitudes and more informal beliefs than low achievers. There were no 
differences between scale scores of students selecting primary and 
intermediate preparatory programs. (Dr. Vance is an Associate Professor in 
the Faculty of Education at the University of Victoria.) 


A major objective in the preparation of elementary teachers of 
mathematics is the development in these prospective teachers of certain 
attitudes toward and beliefs about mathematics and how it is learned. It is 
assumed that teachers’ attitudes towards mathematics will have a bearing 
on the way in which the subject is taught and on how well it is taught. A 
study by Phillips (1973) revealed that students who had had teachers with 
favourable attitudes towards mathematics in each of the three previous years 
had more favourable attitudes towards and higher achievement in arithmetic 
than students of teachers with unfavourable attitudes. 


In an investigation of the attitudes of prospective teachers toward 
arithmetic, Dutton (1951) obtained written statements from 211 students in 
elementary methods classes. He reported that “deep-seated, highly 
emotionalized attitudes that had persisted since childhood” were prominent 
in the thinking of the prospective teachers as they progressed through the 
teacher education program. Dutton constructed a scale to measure attitudes 
toward mathematics. When comparing responses of a group of prospective 
teachers who completed the scale in 1954 and another group who responded 
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in 1962, he found that the attitudes of the two groups were almost identical 
(Dutton, 1962). Smith (1964), however, found that his sample was more 
favourably inclined toward arithmetic than Dutton’s 1954 group. In a study 
of 287 students who had passed a content course and had completed the 
Dutton scale prior to and upon completion of the course, Todd (1966) found a 
significant gain in mean scores. Similar favourable results were reported by 
Reys and Delon (1968) in a study of students enrolled in one of three courses 
in a mathematics preparatory program, and by Hunkler and Quast (1972) 
who investigated the effects on student attitude of a content-methods course. 


In recent years, increased emphasis has been placed on the informal and 
investigative nature of elementary school mathematics. Collier (1972) found 
that prospective teachers in later stages of an elementary education program, 
which included two content and one methods course in mathematics, had 
more informal views of mathematics and instruction in mathematics than 
students beginning their training. 

The purpose of the present study was to compare the attitudes toward 
mathematics and the beliefs about mathematics and mathematics 
instruction of prospective elementary teachers at three stages of preparation 
at the University of Victoria. The attitudes and beliefs of high and low 
achieving students, students enrolled in primary and intermediate sections 
of a methods course, and students enrolled in a methods course with and 
without a particular content course as a prerequisite were also investigated. 


Method 
Scales 

Three Likert-type scales were used, each consisting of 20 items — ten 
positive and ten negative — to which individuals respond according to a 
six-point interval from strongly agree to strongly disagree. The possible 
range of scores on each scale is 20 - 120. 

The Attitude Toward Mathematics Scale (ATMS) is a modification of the 
Dutton-Blum (1968) scale. A score of 70 indicates a neutral attitude; a high 
score indicates positive or favourable feelings about mathematics; a low 
score indicates negative feelings. 

The Belief About Mathematics Scale (BAMS) and the Belief About 
Mathematics Instruction Scale (BAMIS), constructed by Collier (1972), were 
designed to measure a formal-informal dimension of belief. A score of 70 on 
either scale is interpreted to represent a neutral point; scores greater than 70 
are in the “informal” direction; scores less than 70 are in the “formal” 
direction. A high score on the BAMS indicates an appreciation of the creative 
and investigative nature of mathematics, as opposed to a view that 
mathematics is a closed, rigid subject. A high score on the BAMIS suggests a 
strong belief in the importance of student exploration and discovery, as 
opposed to teacher explanation, in learning mathematics. 


Experimental Design 
The scales were administered to various groups of prospective elementary 
teachers enrolled at the University of Victoria in the 1973-74 and 1974-75 


academic years. 


165 


J. H. Vance 


Students in the elementary education program are required to complete 3 
units (two semesters) of approved mathematics and a 1'%-unit mathematics 
methods course. Students not selecting mathematics as a teaching major 
generally fulfill the content requirement by taking Mathematics 160, 
“Fundamental Aspects of Mathematics for Elementary Teachers.” This 
course is taught by instructors from both the Mathematics Department and 
the Faculty of Education. The methods course, Education 744, “Curriculum 
and Instruction in Elementary Mathematics,” is taken by students in the 
“professional” or “diploma” year of their program. Successful completion of 
this year leads to initial certification as a teacher. A student selects a primary 
(K-3) or intermediate (4-7) section of the methods course. 

Three stages of preparation were defined as follows: 


I. Prior to taking Mathematics 160. The scales were administered to 
students enrolled in two sections taught by Faculty of Education instructors, 
during the first class in September, 1974. 

II. Upon completion of Mathematics 160. The scales were administered to 
students enrolled in three sections taught by Faculty of Education 
instructors, near the end of the course in April 1974. The scales were also 
administered in April, 1975, to the two sections who had completed the scales 
at the commencement of the course the previous September. 

III. Upon completion of Education 744. The scales were administered to 
students enrolled in three sections of the course in April, 1974, and in seven 
sections in April, 1975. 

Achievement levels were defined for each stage as follows: 

I. A high achiever had successfully completed both Math 11 and Math 12 
in high school. A low achiever had completed Math 11 but not Math 12, 
which is not required for high school graduation nor for admission to the 
Faculty of Education. 

II. A high achiever had completed Math 12 and obtained a grade of B or 
higher in Math 160. A low achiever had completed only Math 11 and 
obtained a passing grade in Math 160 of B- or lower. 

III. A high achiever had completed Math 12 and obtained a grade of B or 
higher in both Math 160 and Education 744. A low achiever had completed 
only Math 11 and obtained passing grades of B- or lower in both university 
courses. 

A 3 x 2 (preparation by achievement) design was used to compare scale 
scores of independent groups at the three stages and two achievement levels. 


TABLE 1 


CELL FREQUENCIES BY STAGE OF PREPARATION AND 
ACHIEVEMENT LEVEL 


Achievement Level Stage of Preparation 
it iat a ause All 
High 21 28 34 83 
Low 42 25 28 25 
All 63 513) 62 178 
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Table 1 gives the number of subjects in each cell. At each stage, only those 
students who had responded to the scales for the first time at that stage were 
included in the sample. 

In order that longitudinal comparisons could be made, some students were 
tested at two different stages. Data were obtained for 46 subjects who 
completed the scales prior to and upon completion of Mathematics 160 in 
1974-75, and for 27 subjects who completed the scales upon completion of 
Mathematics 160 in 1973-74 and upon completion of Education 744 in 
1974-75. Pre- and post-test scores were compared using the correlated t-test. 

Education 744 students who completed the scales near the end of the 
course in 1974 or 1975 were partitioned into two groups, first according to 
section chosen, primary (K-3) or intermediate (4-7), and then according to the 
content course taken to fulfill the mathematics requirement, Mathematics 
160 or other. In each case mean scores were compared using the t-test. 


Results 
Three Stages and Achievement Level Comparisons 
The mean scores for the three scales for groups at the three stages of 
preparation and two achievement levels are reported in Table 2. 


A two-way ANOVA—least squares solution for unequal cell totals (Winer, 
1962)—was performed for each scale to test for differences in the scores of 
students at the three stages of preparation and the two levels of achievement. 
The summary for these analyses is given in Table 3. 


Because the F ratio for the preparation variable was significant for the 


TABLE 2 


MEAN ATMS, BAMS, AND BAMIS SCORES BY STAGE OF 
PREPARATION AND ACHIEVEMENT 


Achievement Level Stage of Preparation 

it let TEL All 
High SMe) Dans DSi )8h5. 3) 
ATMS Low 82.8 79.8 eyed 83.3 
All tsi7/ 63) 87.0 JOR 88.8 
High TiSte2 74.8 1639 ea!) 

BAMS Low 6979 69.6 Aska) EP 
All HLS WERE 76009 135 
High Do 81.0 90.3 Bia 
BAMIS Low 68.2 LO 84.0 76.4 
All We 79.0 fey/ Ad sles" 


Note. Overall sd for ATMS = 15.3, BAMS = 11.4, BAMIS = 14.0. 
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TABLE 3 
ANOVA FOR PREPARATION AND ACHIEVEMENT 


Source df MS F Pp 
Achievement (A) ik 4166 19.98 .00 
Preparation (P) ; il 186 89 asl 
Se een 2 217 1.04 35 
Within Ly/P 209 
Achievement (A) iL 308 2.48 aly 
Preparation (P) il 498 4.01 OZ 
pig Monae ty 2 96 7 03 
Within ii 2. 124 
Achievement (A) 1 1440 10.03 .00 
Preparation (P) dl 3689 255, 10) .00 
BAMES pep A 2 35 Dai .79 
Within 172 144 


two belief scales, the Scheffé test was used to compare means at the three 
stages. For the BAMS, differences between scores at Stage I and Stage III 
were found to be significant, p<.05. For the BAMIS, differences between 
scores at Stage I and Stage II and between scores at Stage II and Stage III 
were significant, p<.05. As indicated in Table 3, the F ratios for the 
achievement variable were significant for the ATMS and the BAMIS, but not 
for the BAMS. 


Test-Retest Comparisons 

Scores of the groups of students who were tested at two different stages 
were analyzed using correlated t-tests. Mean scores and standard deviations 
at each stage and t-test results are presented in Table 4. 


TABLE 4 
MEANS AND STANDARD DEVIATIONS FOR THE ATMS, BAMS, AND 
BAMIS AT TWO STAGES OF PREPARATION AND T-TESTS FOR 
DIFFERENCES BETWEEN CORRELATED MEANS 


Stage of Stage of 

Preparation 16 2) Preparation t P 
(n = 46) Cb 0) 
IL fae iE IIIL 
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Section Type and Content Course Comparisons 

Means, standard deviations, and t-test results comparing the students 
enrolled in Education 744, when classified according to type of section and to 
background content course, are presented in Table 5. 


TABLE 5 


MEANS, STANDARD DEVIATIONS, AND T-TEST RESULTS FOR SCHOOL 


LEVEL AND CONTENT COURSE 


Section Content Course 
Inter- t p e L P 

Primary mediate Math 160 Other 

(n=114) (n=84) (n=130) (n=64) 


i 


Conclusions 


Analysis of data from the independent groups at the three stages of 
preparation leads to the following conclusions: 


(a) 


(b) 


(c) 


(d) 


Students begin Mathematics 160, the first course in the mathematics 
program for elementary teachers, with fairly positive attitudes 
toward mathematics. The attitudes of students who have completed 
this content course and of students who have completed both the 
content course and the methods course, Education 744, do not differ 
significantly from those of students entering the program. 

Students begin the content course with neutral beliefs about 
mathematics along a formal-informal dimension. About the same 
views are held by students who have completed the content course, 
but students who have completed both the content and the methods 
course have a more informal view of mathematics than those 
beginning the first course. 

Students begin the content course with neutral beliefs about 
mathematics instruction. Students who have completed the content 
course have a more informal view than those beginning the course; 
those who have completed both the content and methods courses 
have a more informal view than students who are beginning or who 
have completed the content course. 

High achieving students in the elementary mathematics program 
have more favourable attitudes toward mathematics and a more 
informal view of mathematics instruction than low achievers. The 
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two groups have about the same beliefs about mathematics along a 
formal-informal dimension. 
2. Analysis of data from students who completed the scales at two 
successive stages of preparation leads to the following conclusions: 


(a) Upon successful completion of the mathematics content course for 
elementary teachers, students possess a more favourable attitude 
toward mathematics and a more informal view of mathematics than 
they had prior to beginning the course. Student beliefs about 
mathematics instruction are about the same at the beginning and 
end of the course. 


(b) Upon successful completion of the elementary mathematics methods 
course, students possess more favourable attitudes toward 
mathematics, and more informal beliefs about mathematics and 
mathematics instruction than they had upon completion of the 
content course for elementary teachers a year earlier. 

3. Analysis of data from students at the completion of the methods course 
leads to the following conclusions: 


(a) There is no difference in attitudes toward mathematics, beliefs about 
mathematics, or beliefs about mathematics instruction between 
students who select a primary (K-3) preparatory program and those 
who choose an intermediate (4-7) preparatory program. 

(b) Upon completion of the methods course, there is no difference in 
attitudes toward mathematics, beliefs about mathematics, or beliefs 
about mathematics instruction between students who fulfill the 
mathematics prerequisite to the course by taking Mathematics 160 
and those who take another course. 


Discussion 


The overall results of the study indicate that students who have completed 
the content and methods requirements in mathematics of the elementary 
education program at the University of Victoria have more favourable 
attitudes toward mathematics, a greater appreciation of the creative side of 
mathematics, and stronger beliefs in the importance of encouraging learners 
to discover ideas for themselves and develop alternate methods for solving 
problems than students entering the program. Inasmuch as the development 
of positive attitudes and certain beliefs about the nature of mathematics and 
how it should be taught to school children is an important goal of the 
mathematics education program, these findings are encouraging. A previous 
study in the mathematical competence of prospective elementary teachers at 
these same three stages of preparation at the University of Victoria (Vance, 
1973) indicated that student understanding of the number system and of 
arithmetic, as measured by tests from the Cooperative Mathematics Tests 
series, increased upon completion of Mathematics 160 and again upon 
completion of Education 744. The final level of performance on these two 
measures was considered satisfactory. Thus the mathematics education 
program appears to be effective, at least to some extent, in increasing 
mathematical competency of prospective teachers and in developing 
desirable attitudes toward mathematics. 
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It is perhaps not surprising that high achievers have more positive 
attitudes toward mathematics than low achievers. Keeping in mind the 
test-retest results at Stages II and III, an examination of Table 2 suggests 
that during the methods course, low achievers experience the greater gains in 
attitude scores. One possible explanation of the results regarding beliefs 
about mathematics and mathematics instruction is that a person who does 
well in mathematics is likely to be more aware of the creative nature of the 
subject and be more willing to encourage other learners to investigate and 
discover ideas for themselves than someone who feels insecure about 
mathematics. A teacher with a poor understanding of mathematics would 
likely feel more confident having pupils practice a demonstrated procedure 
than allowing them to explore freely, and perhaps ask questions on ideas 
unfamiliar to the teacher. 


Kane (1968) found that prospective teachers who have relatively 
unfavourable attitudes toward mathematics prefer the primary grades, while 
those with the most favourable attitudes prefer the intermediate grades. This 
study indicates that students choosing the primary preparatory program and 
those selecting the intermediate program have approximately the same 
attitudes and beliefs about mathematics upon completion of their training. 
No attempt was made in the study to investigate student attitudes at the time 
the choice of program was made. In the competency study (Vance, 1973), no 
differences in the mathematical performances of these two groups were 
found. 


Many students who are accepted into the “professional year” are 
transfers to the Faculty of Education, either from within the university or 
from another institution, usually a regional college. In addition, university 
graduates may qualify for certification as teachers by completing 
professional work in a “diploma” year. Most of these “transfer” and 
“diploma” students, and students choosing mathematics as a teaching 
major, fulfill the mathematics content requirement by taking a course other 
than Mathematics 160. The attitudes of such students prior to beginning the 
methods course were not examined, but it was found that, upon completion of 
the course, there was no difference in attitudes or beliefs about mathematics 
or mathematics instruction between these students and those who completed 
the mathematics course for elementary teachers at the University of Victoria. 
Similarly, in the competency study, no differences between the two groups 
were found on either the structure or the arithmetic test. Thus, the group of 
students who come into the methods course with a content prerequisite other 
than Mathematics 160 complete the course with about the same level of 
competence in mathematics as those who take Mathematics 160, and with 
similar attitudes about the subject and the way it should be taught in 
elementary school. 
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School District Size and the Deployment of 
Personnel Resources 


Deployment of personnel resources (salaries) are reported for 91 high school 
districts in the United States. The major finding is that educational functions 
received a smaller proportion of resources while overhead functions received 
a larger proportion of resources in large districts. Large school districts coped 
with the cost of large size by obtaining sufficient resources from the 
environment to support the disproportionately large overhead and service 
components. Implications for the evaluation and financing of school 
organizations are discussed. (Dr. Daft is an Assistant Professor in the School 
of Business at Queen’s University; Dr. Becker is at the University of 
Chicago.) 


Theorizing about the administrative costs associated with larger 
organization size has occupied economists (Knight, 1933; Stigler, 1966), 
sociologists (Blau, 1970), systems theorists (Boulding, 1953), and others. Most 
of these people agree that there are definite benefits to organization, but 
wonder whether, if beyond a certain size, the cost of administering or 
maintaining the organization begins to outweigh the benefits. 


A great deal of empirical research has been undertaken to provide 
information about the relationship between organization size and the size of 
the administrative component. A recent review, for example, identified 
twenty-four studies that related size of administrative component to total 
organization size (Freeman & Kronenfeld, 1973), and additional research has 
been reported (Coates & Updegraff, 1973; Freeman, 1973; Kasarda, 1974; 
Evers, Bohlen & Warren, 1976). The empirical literature is by no means 
unanimous, but the ratio of administrative to production employees has been 
found to decrease with organization size across a variety of organization and 
industry settings. The pattern of administrative costs typically observed led 
Blau (1972) to conclude: “Contrary to the stereotype of the proliferation of 
bureaucratic machinery in large organizations, the administrative 
apparatus is proportionately smaller in large than in small organizations” 


(p. 6). 


173 


R. L. Daft and S. W. Becker 


Similar findings characterize studies of school organizations. Hawley, 
Boland and Boland (1965) reported that the proportion of full-time university 
administrators decreased as faculty size increased. Large size was associated 
with a smaller administrative ratio in a study of urban school systems in 
Western Canada (Gill, 1967). Another study of Western Canadian school 
systems reported the same cross-sectional finding, although no decrease in 
administrative ratios within districts was observed over a five-year period 
(Holdaway & Blowers, 1971). Studies by Lindenfeld (1961), Bidwell and 
Kasarda (1975), and Hendershot and James (1972) all reported a smaller 
proportion of administrators in larger districts in the United States. The only 
finding of a larger administrative ratio in large districts was reported by 
Terrien and Mills (1955) for California school systems. 


Two other types of research bear on the issue of administrative costs. In 
the one case, Rushing (1967), James (1972), and Kasarda (1974) all found that 
the ratio of clerical employees to production employees increased as 
organization size increased. The implication of these findings is that true 
administrative economies may not exist. Instead, as organization size 
increases, personnel simply may be redeployed from one part of the 
administrative function (top administrators) to another (clerical). The net 
effect of this personnel reallocation may or may not constitute 
administrative efficiency for the organization. 


The other type of finding concerns the proportion of professional staff 
specialists. Typically, professional staff constitute a larger proportion of 
personnel in large organizations (Kasarda, 1973; Bidwell & Kasarda, 1975; 
Holdaway & Blowers, 1971). This may reflect a greater division of labour in 
large organizations or a greater need for staff specialists when the 
organization serves a large population. 


Thus, the issue of proportional administrative costs and personnel 
deployment is complex. Multiple functions compete for resources within 
organizations, and some functions, such as administration and clerical, may 
partially substitute for one another. To properly understand the relationships 
among size, personnel deployment and administrative costs, all functions, 
line and staff, must be considered. 


The study reported here was undertaken to assess the relationship 
between size and the deployment of personnel resources in school 
organizations. All functions in the district, including teaching, 
administrative activities, and staff specialists, are included. Moreover, the 
study deals with salaries associated with each functional activity. Salaries 
represent the true cost to the district of providing administrators, clerks or 
teachers.! The analysis thus provides insight into the relative dollar cost of 
functions across districts of varying sizes. 


Size and Deployment 


The basic premise is that large districts are intrinsically different from 
small ones, that both have different personnel requirements even when they 
have similar goals and technologies. Small districts are expected to have a 
structure that is relatively simple. Supervision and control is typically 
exercised on a personal basis in small organizations. In addition, small 
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communities may not contain the diverse educational needs found in larger 
population centres. Hence small districts probably will not be faced with the 
diversity of resource demands faced by large districts. Large districts, like 
other large organizations, are expected to be bureaucratized and complex 
(Hall, 1972). Pugh, Hickson, Hinings and Turner (1969) found that large 
organizations employ greater use of rules and procedures. Meyer (1968) 
reported that large organizations also tend to have larger spans of control 
and a larger number of both vertical levels and horizontal divisions. 


The complexity and bureaucratic characteristics of large districts can be 
expected to have two influences on resource allocation. First, rules, 
procedures, and other formal communications are important sources of 
coordination and control. Consequently those groups responsible for the 
formal paperwork systems will have to be proportionately bigger and receive 
proportionately more resources in large organizations. Evidence of the 
greater requirement for formalized communications in large organizations is 
the positive relationship between the clerical ratio and size, reported by 
Rushing (1967), James (1972) and Kasarda (1974). 


Second, large district size can have the opposite relationship with 
administrative personnel. It has already been noted that administrators 
frequently comprise a smaller proportion of personnel in large school 
organizations (Hawley et al., 1965; Holdaway & Blowers, 1971; Bidwell & 
Kasarda, 1975). The explanation for smaller administrative ratios is that 
large organizations utilize rules, larger spans of control, impersonal 
communication, and other devices in place of personal supervision. In large 
organizations the administrative savings can be allocated to other 
administrative functions, such as clerical, or to the teaching core which 
provides services to the public. 


Large districts will also differ from small districts with regard to 
specialized educational services. Large organizations experience a greater 
division of labour; tasks are subdivided and specialists are hired to do these 
tasks. Large districts typically function in large, heterogeneous communities, 
and thus tend to experience demand for a greater range of educational 
services (Holdaway & Blowers, 1971). Moreover, large districts can afford the 
cost of new specialists more easily than small districts. A $15,000 salary 
takes a smaller proportion of the total budget in a large district. When the 
need for a new educational service is recognized, a large district can respond 
with a new service or department. 


The other major functions in school organizations are maintenance and 
teaching, but the deployment of resources to these functions has not been 
widely studied. Child (1973) concluded that the maintenance component 
increased at about the same rate as organization size. The maintenance 
component in school districts is expected to increase at least as fast as 
district size. The larger the district the greater will be physical plant size, 
geographical dispersion, and diversity of equipment and facilities. The 
maintenance support function will have to grow to support the increase in 
physical size and physical complexity of the district. Problems of 
transportation, sophisticated equipment, and large plant may even require 
the acquisition of maintenance personnel at a rate faster than the increase in 


total district size. 


175 


R. L. Daft and S. W. Becker 


There is not a great deal of evidence in the literature on the proportion of 
resources allocated to teaching component of schools, or to the technical core 
(Thompson, 1967) of other organizations. Most studies have been concerned 
only with administrative or clerical personnel, or perhaps staff specialists. 
Holdaway (1971b) reported that in Western Canadian school districts, the 
percentage of salaries going to instruction decreased as district size 
increased. Holdaway (1971a) also found that the proportion of instructional 
salaries was higher in metropolitan areas organized into multiple systems 
rather than into a single large school system. Large districts appear to face 
diverse needs from both within (bureaucratization) and without (specialized 
services), so the classroom teaching core of large districts will probably 
receive a reduced share of resources. Other functions will compete for and 
receive a greater share of resources in large districts. If this happens, large 
districts may have to ration teaching services (fewer teaching resources per 
pupil), or obtain greater per student revenues from the environment to 
support the diverse internal activities. 


In summary, administrators were hypothesized to take a smaller 
proportion of resources in large districts, but clerical personnel, staff 
specialists, and perhaps maintenance personnel were expected to require a 
larger proportion of resources. The net effect would be a smaller share of 
resources for teachers unless sufficient revenues are obtained to support the 
non-teaching functions. 


Method and Procedures 


Ninety-one high school districts in the mid-western United States during 
the 1970-71 school year constituted the sample for this study. The districts 
were located in the same state, and the sample included all districts with 
more than 500 students that were organized as four-year high school districts 
(grades 9-12). Neither elementary (grades K-8) nor mixed (K-12) districts 
were included. Four year high school districts have wide variation in size 
with the sample ranging from 535 to 16,035 students. 


The district sample was selected to be fairly homogeneous. One advantage 
of this approach allows for partial control for the effects of variables other 
than size that might influence personnel deployment. For example, the 
districts function under the same state’s rules and regulations concerning 
educational services, financing procedures, and district operations. The 
districts also have a similar purpose (grades 9-12 education) and deal with 
similar educational technologies. 


The disadvantage of this sample is the limited generalizability of the 
results. A single type of district was selected and all districts in that category 
were included. The sample could even be considered a “population” because 
there is not a larger universe with the same characteristics from which the 
sample was drawn. To some extent the findings may generalize to the same 
districts in other years, or to four-year districts in states or provinces with a 
similar legal and financial environment. In general, however, the findings 
should not necessarily be interpreted as representative of other district 
populations. 


The primary function of high schools is classroom teaching, so resources 
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devoted to that activity were allocated to the technical core. School districts 
also provide a range of other services including lunch programs, adult 
education, guidance counselling, speech therapy, athletics, health, and 
special education. Some of these services closely support classroom teaching 
while others do not, so they were divided into two groups: (1) auxiliary 
educational services that support classroom teaching, and (2) other student 
and community services. Other services tend to broaden the base of 
education to include non-classroom activities and to provide services to 
adults and others in the community. 


Each school district files an annual financial statement with the state 
superintendent of public instruction. The salaries expended for each major 
function in the district were obtained from the financial statements. 
Teachers and other district participants occasionally worked in more than 
one activity, so salaries were prorated according to the time spent on each 
function. The variables used in the analysis and the salary categories 
included in each variable are listed below. The latter three variables measure 
sources of revenue to test how large and small districts obtain revenues to 
handle their personnel requirements. The data are for the 1970-71 school 
year. 


Administration. Salaries for all individuals involved in _ the 
administrative function: superintendents, principals, their assistants, 
supervisors, coordinators, business managers, and department heads 
(ADMIN). 


Classroom Teachers. Salaries for teaching services rendered to students 
(TEACH). 

Auxiliary Educational Services. Salaries for personnel who provide 
services to students in support of classroom education. This category 
includes guidance and psychological counsellors, librarians, health, teacher 
aids, and audio-visual personnel (AUXED). 

Other Student and Community Services. Salaries for personnel involved 
in other student and community programs which include adult education, 
summer school, lunch, athletics, manpower development and training, and 
economic opportunity project (SERV). 

Clerical Support. Salaries for secretaries, clerks, and personnel who 
monitor attendance and textbooks (CLER). 

Operations and Maintenance. Salaries for personnel involved in the 
operation and maintenance of buildings, equipment, grounds, and 
transportation (MAINT). 

Community Tax Revenues. Income from local school district property 
taxes (COMM). 

State Flat Grant. State funds provided to districts on a per pupil basis 
(STATE). 

Other Programs. Special programs supported by state and federal 
governments (bilingual education, special education, driver education) 
(PROGS). 

The data were analyzed by computing the elasticity coefficient of the 
salaries paid in each functional component with respect to total salaries paid 
in the district. The elasticity coefficient is the percentage difference in 
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component salaries associated with a one percent difference in size (total 
salaries) across the districts in the sample. An elasticity of 0.8, for example, 
would mean the component salaries increased only 80 percent as fast as total 
salaries, hence the component received a smaller proportion of resources in 
large districts. An elasticity of 1.2 would mean the functional component 
received a larger proportion of resources in large districts. An elasticity of 1.0 
would mean that component salaries changed at the same rate and received 
the same proportion of resources across all districts. The elasticity of each 
component shows whether personnel resources are deployed in a different 
fashion as districts vary in size. 


The technique used to identify elasticities is to compute the simple linear 
regression of the form: 


Log Y = Log a+ b Log X (1) 


where Y = the salaries deployed to some district component, X = total salaries 
paid in the district, and Log a is the Log Y intercept. The regression 
coefficient “b” is the elasticity of the dependent variable with respect to 
changes in the independent variable (Rubenfeld, 1971). The “‘b” coefficient 
also indicates the curvilinearity of Y with respect to X in the raw data. When 
b = 1 for example, the relationship is linear (straight line), indicating that 
both variables increase at the same rate. When b >1 a regression line fitted to 
the raw data would have a slight upward curve, and when b<1 a regression 
line will have a slight downward curve. Estimating elasticities with equation 
(1) has been recommended for assessing the proportional size of 
organizational components (Freeman & Kronenfeld, 1973), and has been so 
used by Coates and Updegraff (1973), Ackers and Campbell (1970), and 
Pondy (1969). Elasticity analysis is widely used by economists (Johnston, 
1972; Chiang, 1967), and has also been used in studies of educational 
expenditures (Sharples, 1974). 


A specific hypothesis was tested for each personnel component. The 
common null hypothesis (Ho) was that the size of each component changed 
at the same rate as district size. The alternative hypothesis (H:) for 
maintenance, clerical, other student and community services, and auxiliary 
educational services was that component size changed faster than district 
size (the component received a larger share of personnel resources in larger 
districts). The alternative hypothesis (H,) for administration and teaching 
was that component size changed more slowly than district size (the 
component received a smaller share of resources in larger districts). 


If b = 1.0 for a component, the null hypothesis is supported. The 
alternative hypotheses were tested by computing the t-value for the 
difference between 1.0 and the observed b coefficient (t =[1.0 - b]/[s.d. of b]). 


The data used in this study are cross-sectional in nature, and the 
limitations of cross-sectional data should be emphasized. The relationship 
between school district size and personnel resource deployment will be 
assessed by comparing across districts. The findings will not demonstrate 
that changes in size over time within a single district will have the same 
outcomes. 
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Results 


The data in table 1 show how the size of each school district component 
changes with respect to district size. The administrative component received 
a smaller proportion of salaries in large districts. The size of the 
administrative component increased only 94 percent as much as district size 
across the districts in the sample. This finding is not strongly significant 
(p < .075), but is in the expected direction. 


TABLE 1 


PERCENTAGE CHANGE IN SUBUNIT SALARIES ASSOCIATED WITH 
CHANGES IN TOTAL SALARIES 


Subunit Elasticity PSL 
ADMIN 0.94 <.075 
ree 0.95 <.0001 
AUXED 1.22 <.055 
SERV 1.18 = 
CLER 1.27 <.0001 
MAINT hes <.0001 


N= 91. Statistical significance computed for difference between elasticity 
coefficient and 1.0. 


The district components which clearly increased their proportion of 
salaries are clerical and maintenance. The salaries allocated to clerical 
increased 28 percent faster than total salaries, and maintenance increased 15 
percent faster. These relationships are highly significant (p<.0001). The 
support requirements for written communications and physical plant 
apparently were much greater in large districts. These salaries can be 
considered as overhead because they don’t contribute directly to student 
education. These overhead salaries comprised a larger share of total salaries 
as district size increased. 

Nonteaching educational services also received a larger share of total 
salaries in larger districts. Salaries for auxiliary educational services 
increased 22 percent faster than total salaries (p<.055). Salaries for other 
services to students and community increased about 19 percent faster than 
total salaries (p<.03). These services appear to be proportionately more 
important in terms of relative size in large districts. 

The finding for the teaching core was as expected. Resources allocated to 
classroom teaching changed only 95 percent as fast as changes in total 
salaries across these districts. For each doubling of salaries across districts, 
salaries for classroom teaching changed by only 95 percent. The remaining 
resources were diverted to clerical, maintenance, and nonteaching 
educational services in the larger districts. Approximately two thirds (67.2 
percent) of the salaries in all of the districts went to classroom teaching. But 
over the range of districts in our sample, the smaller share of resources of 
classroom teaching could be a substantial amount: the largest district is 
approximately 30 times as large as the smallest district. 
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Table 2 provides some additional data on salary allocation. Components 
are combined to show the net efforts of changes in size. The combined 
salaries for administration and clerical changed 6 percent faster than total 
salaries. This means that savings in the administrative component were 
more than used up by proportionately larger clerical components in large 
districts. Adding maintenance to clerical and administration gives some 
indication of the change in total overhead costs as district size changes. 
Combined overhead salaries increased 10 percent faster than total salaries. 


TABLE 2 


PERCENTAGE CHANGE IN COMBINED SUBUNIT SALARIES 
ASSOCIATED WITH CHANGES IN TOTAL SALARIES 


Subunits Elasticity etyateieece 
ADMIN & CLER 1.06 525) 
ADMIN, CLER & MAINT 1h 6 IKO) <.0001 
TEACH & AUXED 0.97 <.0001 
TEACH, AUXED & SERV 0.975 <.0001 


N= 91. Statistical significance computed for difference between elasticity 
coefficient and 1.0 


The other two figures in Table 2 show what happens if the teaching core is 
defined differently. Many educators would argue that providing auxiliary 
services to students and other services to the community are critical 
functions for high schools to perform. When salaries for these other services 
were added to salaries for classroom teaching, the teaching core received a 
slightly larger proportion of resources than when teaching alone was 
considered (.975 in Table 2 vs. .95 in Table 1). But the core still received a 
smaller proportion of salaries in large districts. If these organizations are 
divided into two components — one including all educational services, the 
other including all overhead salaries —- the overhead component increases 
faster than the educational component. Thus the evidence supports the 
notion that large districts cost proportionately more to operate than small 
districts. There are apparently no administrative economies of scale for 
school districts of the sizes observed here when administration is defined to 
include all overhead salaries. 


Figure | translates the findings from Tables 1 and 2 into graph form. The 
lines indicate the change in personnel salaries for school districts of different 
sizes. Solid lines represent changes in specific functions; broken lines 
indicate the changes in total overhead and total educational salaries. The 
lines on the graph oversimplify the variance in the data, but the slopes 
indicate the trends across the 91 school districts. The percentage of salaries 
going to teachers dropped from over 70 percent in the small districts to about 
59 percent in the large districts. Total educational services (TEACH, 
AUXED, SERV) dropped from about 82 percent to 72 percent of total salaries. 
Total overhead costs (ADMIN, CLER, MAINT), on the other hand, increased 
from about 18 percent to 28 percent of salary costs. 
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Figure 1. Percentage of salaries allocated to each component as a function of 
district size. 
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If administrative economies do not exist in large districts, how can this 
problem be handled? Large districts either have to reduce teaching resources 
per student, or obtain additional resources. Additional resources would 
enable large districts to allocate salaries to overhead and nonteaching 
components and still provide comparable resources to classroom teaching. 


The amount and source of revenues in districts of varying sizes can be 
examined with the same kind of analysis. If teaching salaries per student are 
smaller in large districts, then the elasticity of teacher salaries with respect 
to students should be less than 1.0. These data are in Table 3. 


The elasticities in Table 3 are all greater than 1.0. This means that 
salaries paid in every component, including classroom teaching, are greater 
per pupil in large districts, the reason as explained by the last figure in 
Table 3 being that total salaries paid in the districts change almost 14 
percent faster than student enrollment. Large districts in this sample are 
apparently more successful in obtaining resources because they spend more 
salary dollars per pupil. The allocation of salaries within the districts is the 
same as before, of course, with teachers and administrators receiving less 
than the other components as size increases. But total resources to be divided 
among the components was proportionately greater in large districts. Large 
districts apparently supported disproportionately large overhead and 
auxiliary service components and still provided approximately 8 percent 
more salary dollars to classroom teachers. For this school district sample the 
notion can be rejected that rationing of teaching services will occur, i.e., that 
large districts will expend fewer teaching resources per student as a method 
of supporting the increasingly large administrative and maintenance 
apparatus. 


Some of the greater teaching salaries per pupil spent in large districts may 
be due to the following factors: (1) higher wage rates in urban areas, (2) 
higher ratios of teachers to pupils, (3) more highly educated teachers or (4) 
more experienced teachers. Thus, higher teacher salaries per pupil may 
reflect more teachers, higher wages, or more highly educated and experienced 
teachers. The data do not indicate exactly how the additional salaries per 


TABLE 3 


PERCENTAGE CHANGE IN SUBUNIT SALARIES ASSOCIATED WITH 
CHANGES IN STUDENT ENROLLMENT 


steers Elasticity tee, aa 
ADMIN tan <.075 
TEACH 1.08 <. 0006 
AUXED ua <.005 
SERV 1635 <.002 
CLER 1.45 <.0001 
MAINT Wye <.0001 
tee eR <.0001 


N= 91. Statistical significance computed for difference between elasticity 
coefficient and 1.0 
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TABLE 4 


PERCENTAGE CHANGE IN RESOURCE INPUTS ASSOCIATED WITH 
CHANGES IN STUDENT ENROLLMENT 


Resource Sector Elasticity Statistical 
Significance 
COMM Mes 9 <.0001 
STATE 0.99 ace) 
PROGS 1703 <2 


N= 91. Statistical significance computed for difference between elasticity 
coefficient and 1.0 


pupil are spent. But large districts are able to support larger overhead 
components and more educational specialists, and still deliver more dollars 
per pupil to the classroom. 


If teaching dollars are not restricted in large districts, and if overhead and 
auxiliary educational components are proportionately larger, who supports 
these activities? The relationship between origin of financial support and 
organization size is shown in Table 4. School district revenue in the U.S. 
comes from three sources: community taxes; state flat grants; and federal, 
state and local programs. Districts have little control over state flat grants, 
but can influence to some extent the income from other sources. 


The data in Table 4 show that large districts received greater financial 
support from the local community. Income from state grants and other 
programs increased about the same rate as student enrollment, but income 
from local community taxes increased about 19 percent faster than student 
enrollment. These data do not indicate whether this increase can be 
attributed to a greater tax base in larger communities or whether large 
districts simply extract a larger share of available community resources. 

The relationship between size and program funding is somewhat 
unexpected. Large school districts might be better equipped to obtain funds 
from state and federal programs to provide specialized classroom and 
community services. The third coefficient (1.03) in Table 4 indicates that this 
is not the case. Most program funding is a subsidy to encourage the school to 
adopt the program. Funds are there for the taking which, apparently, most 
schools do. The amount paid reflects the school’s size. Large districts do not 
appear to have an advantage in obtaining revenues to provide specialized 
educational functions, but they do obtain more revenues from the local 
community. 


Discussion and Conclusions 


The expectations concerning the allocation of resources within the school 
districts are generally supported. The salaries devoted to administration take 
a smaller percentage of total salaries in large school districts. One 
explanation is that larger districts are administered with the use of 
impersonal communication and control devices such as budgets, memos and 
written procedures. These impersonal methods may save administrators but 
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they require additional clerical services and personnel. The additional 
resources consumed by the clerical group more than offset the saving in 
administrators. The maintenance component was also proportionately larger 
in the large districts and added to the overhead costs. The net effect was that 
overhead functions required a larger percentage of personnel costs in larger 
districts. In large school districts a larger share of salary resources was also 
allocated to educational services other than teaching. This finding reflected 
the greater demand for diverse services to meet the educational needs of a 
large student body and a more heterogeneous community population. 


The deployment of personnel to overhead and nonteaching educational 
services led to the most important finding that a smaller percentage of salary 
resources were channelled into the teaching core in large school districts. 
Classroom teaching received the majority of resources in each district, but it 
was clearly a smaller proportion in large districts. 


The data also indicated that large districts were able to obtain sufficient 
resources from the local community to meet the requirements associated with 
large size. Lortie explains this by suggesting that large school districts are 
better able to coopt other people and organizations in their environments, 
and hence are able to secure the support necessary to obtain funding for the 
various programs.” 


The finding that large districts have proportionately greater overhead 
costs is consistent with Holdaway’s (197la, 1971b) studies of Canadian 
school systems. Holdaway found noninstructional costs higher in larger 
districts. Other studies of the administrative component typically examined 
only administrative personnel, or clerical personnel, or staff specialists. The 
total cost of the overhead machinery was understated in those studies 
because the combined cost of overhead functions was not tested. By 
examining resource deployment throughout the school organizations, the 
opposite result is found. Overhead costs are proportionately greater in large 
rather than small districts. 


The impact of size and revenues has implications for public policy 
decisions regarding the delivery of educational services. It is often proposed 
that districts be financed on a uniform per pupil basis (Guthrie et al., 1974). 
In this way inequities across districts in the provision of educational services 
supposedly will be removed. But the findings above suggested that uniform 
financing may not equalize educational services because large districts 
allocate proportionately more resources to overhead activities. Under 
uniform financing a smaller percentage of the educational dollars may reach 
the classroom in larger districts. Larger districts may have difficulty meeting 
requirements for overhead costs and for nonteaching services, and still 
provide comparable resources to classroom teaching. The problem of 
financing public services in a nearly equal manner affects many types of 
public organizations. New alternatives, such as uniform financing of 
technical core activities and unequal financing of support activities, may 
offer a better solution than uniform financing on a per capita basis. 


Another implication of the above findings concerns the evaluation of 
performance in schools. Typically outputs are difficult to define and measure. 
Hence there is often a tendency to evaluate school organizations on the basis 
of resource inputs such as the cost of personnel, equipment, or supplies. But 
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small districts may appear to be more efficient on the basis of certain 
financial cost ratios. Overhead costs will be proportionately higher in large 
districts, but this may be because of legitimate requirements for greater 
clerical and maintenance services. The performance evaluation of large and 
small districts should allow for different cost requirements. 


These findings also have implications for administrators within 
educational organizations. In large districts, administrators should be 
prepared to invest in overhead activities. The large organizations have 
greater requirements for paperwork, other written communications, and 
maintenance. These requirements appear to be legitimate, so administrators 
should not restrict the overhead functions because of the belief that only the 
teaching core does important work. Moreover, support activities may make 
teaching personnel more effective. Administrators should also be cautious 
about comparing their performance to other districts on the basis of cost 
ratios. Useful comparisons are most likely to be observed when districts are 
similar in size. 


In conclusion, there appear to be mixed blessings associated with large 
district size. The large districts in this sample provided more specialized 
services, and they were able to obtain sufficient revenues to meet a variety of 
organizational and educational requirements. But proportional costs of 
overhead personnel were greater. More research is clearly needed into the 
allocation and utilization of resources in school organizations. Many 
questions remain to be answered. If, for example, research can determine 
whether the proportionately greater overhead cost associated with large size 
is related to output, then investigators can say more about how school 
organizations should be organized. Perhaps school organizations would 
utilize resources more efficiently by delivering services with groups of small, 
specialized organizations rather than with large, consolidated ones. 


Notes 


1. Salary costs in a function may be influenced by local wage rates, employee qualifications, and the number of 
personnel in the function. Daft (1976), however, reported that the use of salary costs yielded the same results 
as number of personnel in organizational analysis, because number of personnel is by far the most important 
determinant of salary costs. 


iS) 


. Professor Dan C. Lortie, Department of Education, University of Chicago — personal communication. 
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Iconic and Immediate Memory in Elementary 
School Children 


The effects of five variables in Iconic and Immediate Memory in elementary 
school subjects (sex, age, grade, exposure time, and stimulus density) were 
studied using a simultaneous presentation of consonant letter stimulus items 
and immediate recall. The data were collected individually from 240 subjects 
(20 subjects for each age-grade combination in the elementary school) using 
tachistoscopically controlled exposures and oral recall. The results indicated 
that all variables, with the exception of the sex of the subject, were significant 
factors in Iconic and Immediate Memory. As age and grade increased, recall 
on all tasks increased significantly. The results of increasing exposure time 
and stimulus density on recall indicated that the younger subjects (up to eight 
years of age in grade three) tended to recall fewer letters whereas the older 
subjects tended to increase the number of letters recalled. The study suggests 
that subjects in grades three to six demonstrate a fully developed Iconic 
Memory capacity but only the subjects in grade six show a functionally 
developed Immediate Memory capacity for storage and recall of letters. This 
study, therefore, supports the basic concept of the stage-theory of the 
structure of memory. (Dr. Janzen is an Associate Professor in the Department 
of Educational Psychology; Mr. Ewert is a doctoral student in the same 
department.) 


Without a doubt, the primary activity required for school children (and 
everyone else, for that matter) is to process information, whether visually, 
aurally, or tactually. Theories from the time of William James’ (1890) to 
Miller’s (1956) to Sperling’s (1960) to Norman’s (1969) research clearly 
support the concept of memory on two dimensions, one being a short-term 
recall factor, the other the capacity to process information. Conceptual 
refinement by contemporary researchers (Kumar, 1971) and utilization of 
information processing theory suggested the concept of memory as an active 
participant in the processing of information. 


The present study utilized Keele’s (1973) concept of the basic structure of 
memory as comprising of: (1) Sensory Register (SR) or Iconic Memory (IcM), 
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(2) Short-Term Memory (STM) or Immediate Memory (IM), and (3) 
Long-Term Memory (LTM). Keele (1973) suggested that these stages can be 
distinguished from each other by time and space. The constructs of Iconic 
Memory and Immediate Memory, however, have been developed primarily 
with data from adult subjects. While elaborating the model, this has provided 
little information about the development of these constructs in children. 


In this paper, the conceptualization of the iconic memory system is 
assumed to involve central processing, it being recognized that in most 
memory systems, only peripheral processes may be operating (Turvey, 1973). 
In Turvey’s research, where the relevant features of recall included location, 
size, and color (retrieval cues with the partial report), one can see why he 
suggests that iconic memory consists of a series of peripheral visual systems. 
In the case where not the critical physical features, but the associative 
features (such as whether an item is a digit or letter; or recall of a series of 
letters) are measured, a more central system is used. It is this latter 
interpretation that this study follows (Baddeley, 1976). Whether peripheral or 
central, it should be stated that the systems probably work in parallel at 
different processing rates, depending on other factors such as exposure time 
and stimulus density. In the case where the information must be extracted 
from an iconic store and held for a short duration, it is probable that iconic 
memory does require complex cognitive processing. 


Iconic Memory 


The capacity of Iconic Memory is larger than the amount of information 
transferred into Immediate Memory (Sperling, 1960). At the Iconic level of 
processing the loss of information is primarily due to the rapid decay of the 
visual trace (Neisser, 1967); however, in the case of partiai reports (Sperling, 
1960; Averbach & Coriell, 1961), the loss of information also occurs due to the 
selection of specific items for recall. In Iconic Memory, the capacity 
established by the free recall of stimulus items briefly displayed indicates the 
recoding capacity from the rapidly decaying visual trace into verbal storage 
for recall (Sperling, 1960). This recoding capacity may more easily be 
conceptualized as the transfer capacity, the amount of information that can 
be transferred (recoded) from the rapidly decaying visual trace into verbal 
storage for oral recall. This transfer capacity of Iconic Memory is, in this 
study, of more interest than the absolute capacity, as established by 
estimations from the partial report recall method used by Sperling (1960) and 
Averbach and Coriell (1961). Although the actual size of the Iconic Memory 
capacity may be no different among subjects in elementary school, the 
amount of usable information that can be recoded from the absolute capacity 
may change as age and grade levels change. 


Although Haith (1971) reported a lack of significant age differences in 
visual processing speed for single geometric figures presented 
tachistoscopically, he did report significant differences in recall between 
younger and older subjects for presentations of multiple stimuli. Teichner 
and Sadler (1962) found that at brief exposure times (up to 500 milliseconds), 
item recall for adults was positively related to stimulus density. If, as Haith’s 
results indicate, increased stimulus density is negatively correlated with 
recall in younger subjects, the development of Iconic Memory may be best 
investigated by establishing the particular age-grade level at which recall in 


189 


G. D. Ewert and H. L. Janzen 


an Iconic Memory task is positively related to stimulus density. If, as the 
results reported by Teichner and Sadler (1962) and Haith (1971) suggest, the 
positive relationship between recall and stimulus density is a major 
distinguishing feature in comparing the recall of the mature with the 
younger subjects, one could therefore hypothesize that subjects whose recall 
increases as stimulus density increases, have a functionally matured Iconic 
Memory. 


Immediate Memory 


In Immediate Memory, the loss of information is primarily due to the lack 
of rehearsal of the items (Norman, 1969; Kumar, 1971). In contrast to the 
Short-Term Memory task, the Immediate Memory approach is not concerned 
with how many trials are required before a subject achieves perfect recall; 
rather, this method is concerned with the number of items a subject can recall 
after one simultaneous exposure of the items. Blankenship (1938) defined 
memory span as the ability of an individual to reproduce immediately, after 
one presentation, a series of discrete stimuli in their original order. This latter 
whole-report approach is the one the present study will be using. 


The finding that recall on Immediate Memory and Short-Term Memory 
tasks is positively related with age has been well established (Blankenship, 
1938; Dornbush & Basow, 1970; McCarver, 1972). These findings did not, 
however, attempt to distinguish between the age and educational effects of 
the development of Immediate Memory. Subjects of the same age level, but in 
different grade levels, may not perform similarly on Immediate Memory 
tasks. The difference in grade level, as well as age, may be reflected in a 
difference in recall on Immediate Memory tasks. 


Corsini (1971) states that the development of Immediate Memory is 
directly related to the ability to use mnemonic strategies. The problem in the 
present study was to demonstrate that the increased recall related to age 
increases (Blankenship, 1938; McCarver, 1972) may be due to the 
development of mnemonic strategies for recall in Immediate Memory. The 
most effective method to resolve this dilemma is the use of two stimulus 
densities, one with a relatively light information load of six letters, the other 
with a relatively high information load of twelve letters (which would be well 
beyond the capacity of any of the subjects to recall completely). The recall for 
the complex stimulus array of those subjects, with mnemonic strategies for 
processing complex stimulus arrays, should be significantly greater than the 
recall of those subjects lacking the necessary mnemonic strategies. The 
difference in recall may also be demonstrated between the subject’s recall for 
the two stimulus densities at the same exposure time as well as over a variety 
of exposure times. If the recall for the low density stimulus array is always 
greater than the recall for the high density arrays, one could conclude that 
the necessary mnemonic strategies for processing the high information 
arrays may not have been developed. Further, if the recall for the high or low 
density arrays did not increase, or if in fact, the amount recalled decreased as 
the exposure time increased, this result would seem most indicative of the 
lack of development of the necessary mnemonic strategies. Therefore, it may 
be postulated that the age-grade level, at which the subject’s recall of the 
stimulus items for both stimulus arrays increases as the exposure time 
increases, would indicate a functionally developed Immediate Memory. 
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Research Postulate 


Since previous research in Iconic and Immediate Memory using 
school-age subjects is lacking, the present study did not formulate specific 
hypotheses. Rather, the major postulate under investigation was to examine 
age-grade differences in recall for two different stimulus densities and 
varying exposure times in two stages of the memory model. For those 
subjects whose Iconic capacity is functionally mature (as measured by a 
consistently greater recall on the twelve-letter condition as compared to the 
six-letter condition), it is postulated that their recall on Immediate Memory 
tasks will be consistently greater than those subjects whose Iconic Memory is 
functionally immature (as measured by a consistently greater recall on the 
six-letter condition as compared to the twelve-letter condition). Analysis of 
results across age groups should indicate, therefore, the necessity of a 
functionally effective Iconic Memory as a prerequisite for the development of 
an effective Immediate Memory. If this is indeed the case, the descriptive 
model of memory processes (Keele, 1973) would also be applicable as a 
developmental model. 


Procedure 
Subjects 


Two hundred and forty school subjects participated in the study. Ten 
males and ten females for each age-grade combination were randomly 
selected and tested. The subjects were drawn from two urban elementary 
schools in Edmonton and one suburban elementary school in Sherwood 
Park. 


Apparatus 

A Polymetric Company Tachistoscope Model V-0959T (234 South Eight 
Street, Reading, Pennsylvania, U.S.A.) was used to present the visual 
stimulus cards. The visual stimulus cards were eighteen 3- by 5-inch plastic 
covered white cards: one card with twelve digits, eight cards with six 
consonant letters, and nine cards with twelve consonant letters. Cards with 
twelve letters or twelve numbers on them had a %4-inch space between the 
items. The cards with six letters on them had a '-inch space between the 
items. All letters were centered with a one-inch border on each side and a 
1-29/32 inch margin on the top and bottom of these cards. The consonants 
and numbers were 4- by 3/16-inch in size and were chosen from a Letrite 4 
printing set. The pre- and post-exposure field was a plastic covered white 
card. 


Method 

When the subject entered the testing room he was asked to sit on a chair 
placed in front of the tachistoscope. The subject was asked the grade level he 
was in and how he enjoyed school. Some informal conversation was always 
engaged in with the subject to allow time for some of the strangeness of the 
experimenters and the situation to dissipate before testing started. The 
subject’s name, grade, age, birth date, testing date, and sex were recorded on 
the data sheet. Two experimenters were always in attendance, one to record 
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the biographical and response data, the other to give the instructions and to 
administer the tasks. The order of testing followed a standard procedure 
which paralleled the hypothesized developmental order of Iconic Memory, 
and then Immediate Memory. This order of presenting the task was the most 
efficient as each task introduced the task that followed, giving the testing 
situation a natural progression from task to task rather than jumping from 
one task to another. 


The procedure and instructions will be presented under the appropriate 
sub-headings which will name the factor that was being investigated at that 
time. 


Iconic Memory. The subject was seated on a chair in front of the 
tachistoscope when the following instructions were given: 

I am going to flash something very quickly on the white screen at the back of this 
box (referring to the tachistoscope). I am not going to tell you what I will flash. I 
want you to tell me what you see. 

At this time the subject was positioned with his face fitting snugly into the 
face guard on the tachistoscope in order to keep out extraneous light. 

I want you to keep your face in this head guard as tightly as possible. I want you to 
look at the screen at the back. Just before I flash the things on the screen I will say 
“READY,” then I will flash them. Do not worry if you cannot see them right away, 
they are flashed so quickly that most people may not be able to see them right 
away. All right? “READY.” 

At this point the first exposure of stimulus items was given at the 200 
millisecond exposure level. Two exposures were given, one of letters and one 
of numbers. Whether the numbers or letters were presented first depended 
upon the order given to the preceding subject. If the preceding subject had 
been given the letters first, the following subject was exposed to the numbers 
first and so on, to avoid a bias towards which stimuli would be recognized 
more quickly than the other. If the subject did not recognize the items 
presented, alternate trials of letters and numbers were presented at each 100 
millisecond increment level until both sets of stimuli were recognized. Each 
unsuccessful exposure level was marked on the data sheet as an “x” and the 
successful trial was marked with a check-mark. The instructions attempted 
to reassure those subjects who would not immediately recognize the stimuli 
flashed and also attempted to ensure attention at the proper time by the use 
of a cue before the exposure. The subject was also praised for his performance 
in the following manner: 

You did very well on that part. You see that was not so hard was it? 


The exposure level at which the subject recognized letters as letters was 
used as the exposure level for the six trials presented in the Iconic Memory 
tasks. The subject was instructed: 

Now, this time I am going to flash more letters on the screen. They will flash just 
as quickly as they did before but this time I want you to tell me as many of the 
letters you saw as you can. I will say “READY” and then I will flash the letters. 


Remember, as soon as the letters flash tell me as many of the letters you saw as 
you can. All right? “READY.” 


The letters were in groupings of 6 and 12 letters per card, with three 
presentations of the six-item displays, and three presentations of the 
twelve-item displays. The order of presentation was either: 6, 12, 12, 6, 6, 12 or 
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12, 6, 6, 12, 12, 6 depending on the order used for the previous subject. The 
order used alternated from subject to subject in order to randomize out a 
recall bias due to seeing a 6-item versus a 12-item display first. After each 
exposure the letters recalled were recorded as well as the card number and the 
order of presentation. Those subjects who recalled the entire or the most part 
of the alphabet were cautioned to recall only those letters that they actually 
saw. A new stimulus card was then presented as the first trial. If the subjects 
persisted in randomly recalling strings of letters with no relationship to the 
stimulus items, they were at this point thanked for their participation in the 
study, returned to their classroom, and deleted from the study. With the 
subjects continuing in the study, testing continued until a total of three 
6-item and three 12-item displays were responded to. If a subject responded 
that he could not see the items, the particular card was set aside and shown 
again at the end of the original exposure order. Any card exposed more than 
once was noted on the data sheet. Since more subjects were tested than 
required for this study, those subjects having no recall and those requiring 
more than one extra exposure of stimulus cards were deleted from the study. 
After each exposure the subject’s recall was responded to by the experimenter 
in the following manner: 
That’s very good, try this one. 


At no time during testing were the subjects told whether the 6- or 12-item 
display was going to be shown. 


Immediate Memory. Upon completion of the six trials for Iconic Memory 
performance the subject was instructed: 
You did very well on that part, now this time I am going to show you more letters 
but this time you will have a much longer time to look at them. That will make it 
easier for you to remember them won’t it? (pause for the subject’s response). This 
time though, I want you to remember the letters from left to right in the same order 
that you see them (at this point the subject was shown the letter stimulus card 
used for establishing the Recognition Threshold and shown with the aid of this 
card what was required). Remember, when the letters disappear you tell me as 
many of the letters that you saw as you can, but please remember them in order. 
Do not repeat the letters out loud while they are on the screen, but as soon as they 
disappear tell me, in the correct order, as many of the letters as you can remember. 
The order of presentation was the opposite of the order used in testing 
Iconic Memory performance, either 12, 6, 6, 12, 12, 6 or 6, 12, 12, 6, 6, 12. The 
first pair of 6 cards were exposed for 3 seconds, the second pair for 6 seconds, 
and the final pair for 9 seconds. After the last 3- and 6-second exposure and 
prior to the first exposure at the next time level, the subject was instructed: 
This time you will have a longer time to look at the letters. Remember to tell me the 
letters in the order they were on the screen. 


After each attempt by the subjects to recall the items they were told: 
That’s very good, now try this one. 
Those subjects who persisted in recalling letters randomly or who could 
not recall any letters after any one of the exposures of stimulus items were 
deleted from the study. 


When testing was completed, if the subjects wished to examine the 
tachistoscope, they were shown how it worked. They were not given any 
information which could affect the performance of another subject to whom 
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they might talk. In general, the subjects who were the first from the 
classroom were slightly apprehensive and an attempt was made to make 
them feel comfortable. The younger subjects in particular were instructed 
after the testing was completed: 
You did very well on that. You see it was not hard at all, in fact it was rather fun, 
wasn’t it? (pause for subject’s response). Be sure to tell the rest of your class how 
much fun it was, but don’t tell them what we did because we want to surprise 
them. 

Some subjects with partial blindness, a recent illness, or subjects who had 
engaged in a vigorous physical exercise just prior to testing were deleted from 
the study. The testing session was designed to last approximately five 
minutes in order to ensure a high level of subject attention and performance 
during testing. The actual time taken varied from five to seven minutes, 
depending on the age of the subject. The younger subjects usually required 
additional repetitions of the instructions. 


Throughout all testing, the whole-report technique was utilized. 


Results 

Analysis 

A one-way analysis of variance with a Newman-Keuls comparison 
between ordered means was used to examine the between-group differences. 
A three-way analysis of variance was used to examine the effects within and 
between groups on the variables “stimulus density,” “exposure level,” and 
“sex’’ of subject. Since grade level was confounded with age, it was decided 
not to analyze grade as a separate factor, but concentrate primarily on age as 
the developmental variable. All programs used were supplied by the Division 
of Educational Research, University of Alberta, Edmonton, Canada. 


The results of 240 elementary school subjects were obtained for all Iconic 
Memory and Immediate Memory variables. Each grade level included two 
age groups whose average age difference for the total sample was 7.26 
months. Each age level, with the exception of the six- and twelve-year-olds, 
occurred in two grade groups. The average age difference between subjects of 
the same age but different grade level was 4.26 months. The subjects in the 
higher grade tended to be slightly older than their age-mates in the lower 
grade. In each group an equal number of male and female subjects were 
tested. 


Table 1 shows the means and F-ratios for all groups over all variables. All 
the F-ratios are significant at the .01 level or greater. Tables 2 to 9 present the 
results of the Newman-Keuls comparison of ordered means for each variable 
used in the study. 

Iconic Memory (IcMe, IcMi2). The number of letters correctly recalled 
tended to increase as age and grade increased. Tables 2 and 3 show the mean 
recall for the six- and twelve-letter displays respectively for each age-grade 
group with the stimulus exposure at each individual’s specific letter 
recognition exposure level. 


In Table 2 for the six-letter displays (IcMs), I-6 subjects had significantly 
lower recall than all other groups. This finding reveals a significant age 
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TABLE 1 


ONE-WAY ANALYSIS OF VARIANCE WITH GROUP MEANS 
AND F-RATIOS 


Grogs? Variables 
Ic.M/6 Ic.M/12 IM (3/6) IM (3/12) IM (6/6) IM (6/12) IM (9/6) IM (9/12) 

I-6 1h, AOS 1.2835 2.7000 2.9000 3.0000 2.3500 3.2500 2.6500 
1-7 1.6665 1.9670 3.3000 2.9000 2.8500 2.2500 3.1500 2.3000 
II-7 2.1830 2.0500 3.7000 3.2500 3.6500 2.6000 3.8000 3.0500 
11-8 2.0340 1.9340 2.8000 2.9500 3.4000 2.1500 3.2000 2.5000 
III-8 25 2.6995 4.3000 3.7000 4.8500 3.2500 4.6000 3.8000 
I1I-9 2.5505 3.0505 4.7090 4.9000 4.9500 4.1000 5.2000 4.5000 
Iv-9 2.6655 2.9500 4.4500 4.1000 4.6000 3.9500 5.0500 4.8500 
Iv-10 2.6670 2a71320 3.8500 3.9000 4.2500 3.3000 4.5500 3.7500 
v-10 2.2665 2.6995 4.3500 4.1500 4.8000 3.3000 5.4500 4.1500 
v-11 2.9325 3.1495 4.4000 3.9500 4.9000 4.2000 5.0500 4.0000 
VI-11 2.6830 2.8825 4.8500 4.4500 Bee 500 4.4500 5.4000 5.1500 
VI-12 2.9995 3.4160 4.8000 4.3000 5.2000 4.8500 5.0500 5.3000 
F-Ratio 10.87 10.74 8.84 4.28 10.45 10.80 13.83 13.70 


Note. An F-ratio of 3.66 is significant at the .01 level. 


9 N = 20 for each age-grade group, matched by sex. 


TABLE 2 


NEWMAN-KEULS COMPARISON OF ORDERED MEANS OF ICONIC 
MEMORY RECALL FOR SIX LETTERS 
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difference between I-7 subjects and I-6 subjects. However, III-8 subjects 
show significantly greater recall than II-8 subjects. 


In Table 3 for the twelve-letter displays (IcMi2), I-6 subjects had 
significantly lower recall than all other groups. All the older subjects (III-8 to 
VI-12) showed significantly greater recall for the twelve-letter displays than 
the younger subjects (I-6 to II-8). The trend for the younger subjects (I-6 to 
II-8) to recall fewer letters than the older subjects (III-8 to VI-12) is 
noticeable in the six-letter condition, but is only statistically significant for 
the twelve-letter condition. 

Immediate Memory (IM3-6, IM3-12, [Mo«, [Mo-12, IMo«, IMo-12). Tables 4 and 
5 show the mean recall of all groups for the six- and twelve-letter displays 
respectively at the three-second exposure level (IM3«, and IM3.;2). 


In Table 4 for the six letter displays, all older subjects (III-8 to VI-12) have 
significantly greater recall than subjects in I-6 and II-8. With the exception 
of IV-10 subjects, all older subjects (III-8 to VI-12) have significantly greater 
recall than I-7 subjects for the six-letter display. 


As can be seen in Table 7, for the twelve-letter display, with exception of 
groups III-8, IV-10 and V-10, all older groups have significantly greater 
recall than all younger subjects (I-6 to II-8). In addition, both age groups in 
grade six (VI-11 and VI-12) have significantly greater recall than the 
subjects in groups III-8, I-10 and V-10. 

Tables 8 and 9 show the mean recall for all groups for six- and 
twelve-letter displays respectively at the nine-second exposure level 
(IM 9 and IMo-12). This nine-second exposure level provides the most complete 
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TABLE 4 
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TABLE 6 
NEWMAN-KEULS COMPARISON OF ORDERED MEANS OF 
IMMEDIATE MEMORY RECALL FOR SIX LETTERS AT THE 
SIX-SECOND EXPOSURE LEVEL 
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TABLE 8 


NEWMAN-KEULS COMPARISON OF ORDERED MEANS OF 
IMMEDIATE MEMORY RECALL FOR SIX LETTERS AT THE 
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distinction between the younger subjects (I-6 to II-8) and the older subjects 
(III-8 to VI-12) for both the six- and twelve-letter displays. For the six-letter 
display, all older subjects (III-8 to VI-12) have significantly greater recall 
than all younger subjects (I-6 to II-8). (See Table 8.) From the results in 
Table 9, one can see that for the twelve-letter display, with the exception of 
groups III-8, IV-10 and V-11, all older subjects have significantly greater 
recall for twelve letters than all younger subjects (I-6 to II-8). 


The three-way analysis of variance used a 2 x 3 x 2 design with repeated 
measures on two variables to examine performance on Immediate Memory 
tasks. The repeated measures variables were stimulus density and exposure 
level. Table 10 contains a summary of the analysis. 

As can be seen from Table 10, the sex of the subject (variable A) was nota 
relevant factor in recall on Immediate Memory tasks (p< 0.337). The effect of 
varying stimulus density (variable B) was identified as a relevant factor in 
recall on Immediate Memory tasks (p<0.001). This result indicates that the 
subjects recalled significantly more of the six-letter than the twelve-letter 


TABLE 10 
SUMMARY OF THE THREE-WAY ANALYSIS OF VARIANCE 


Source SS DF MS F 2) 
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displays across all exposure levels used. The effect of varying exposure times 
(variable C) was identified as a relevant factor in recall on Immediate 
Memory tasks (p<0.001). This result is difficult to interpret specifically 
because of the apparent inconsistency among the subject groups. This 
inconsistency in effect is highlighted when one compares the F-ratios for 
stimulus density (107.95) which is a consistent factor in recall, and exposure 
levels (13.40) which appears to be an inconsistent factor in recall in terms of 
generalizing specific conclusions concerning exposure levels to all subjects. 
The interactions observed between sex and stimulus density (AB p<.956) 
and sex and exposure levels (AC p<.781) indicate that the significant effects 
on recall observed for stimulus density and exposure level were not related to 
the sex of the subject. This result is further supported by the lack of a 
significant interaction between sex, stimulus density, and exposure level 
(ABC p <.367). 

The significant interaction observed between stimulus density and 
exposure level (BC p<0.001) indicates a combined effect on recall. It should 
be noted that the interaction F-ratio (8.68) is lower than either of the main 
effects. Further analysis of the data may be due to the differential effects of 
exposure time on stimulus density over the subject groups. 


Discussion and Conclusion 


Since very few studies of this kind have been reported, it is very difficult to 
compare results with other studies. The discussion that follows clarifies the 
results. The major finding of increased recall as a function of age agrees with 
the results obtained by Blankenship (1938), Dornbush and Basow (1970), and 
McCarver (1972). 

Iconic Memory. The results of the one-way analysis of variance indicate a 
significant difference between groups in the mean recall of 6 and 12 letters. 
The differences between groups seems to be related to an increase in age and 
grade based on the results of the Newman-Keuls analysis. The significant 
difference between I-7 and I-6 subjects suggest a maturational effect on 
performance at this level. However, the significant difference observed 
between III-8 and II-8 subjects tends to suggest an educational effect on 
performance. The trend observed with the recall of 6 letters was for the I-6, 
I-7, II-7 and II-8 subjects to form a group of the lowest means for the recall of 
letters. All subjects in groups III-8 to VI-12 performed significantly better on 
mean recall for 12 letters than did I-6 to II-8 subjects. For the 6- and 12-letter 
mean recall, all groups performed significantly better in mean recall than the 
I-6 subjects. 

One of the most important findings was the trend for subjects in III-9 to 
VI-12 to consistently recall more of the 12-letter displays than of the 6-letter 
displays. This finding is emphasized by the trend for all age-grade groups 
III-8 and above to recall significantly more of the 12-letter displays than 
subjects in II-8 and below. It appears that this difference between the 
eight-year-old subjects in grades two and three marks the emergence of a 
functionally effective Iconic Memory. This interpretation is further 
supported by the consistent trend for the older subjects (III-9 to VI-12) to 
recall more of the 12-letter than of the 6-letter displays, as compared to the 
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younger subjects (I-6 to III-8). If, as the results reported by Teichner and 
Sadler (1962) and Haith (1971) suggest, the positive relationship between 
recall and stimulus density is a major distinguishing feature in the 
comparison of the recall in Iconic Memory between those subjects with a 
functionally developed Iconic Memory and those lacking a developed Iconic 
Memory, it seems reasonable to conclude that the subjects in groups III-9 to 
VI-12 do demonstrate a functionally developed Iconic Memory while those 
subjects in groups I-6 to III-8 do not. 


Immediate Memory. The development of this stage of the memory process 
was examined by the recall for high and low stimulus densities over three 
exposure times. 


The results of the one-way analysis of variance indicate the existence of 
significant differences in mean recall for both stimulus densities at the three-, 
six- and nine-second exposure levels. The trend noticed in Iconic Memory for 
the I-6 to II-8 groups to clump together with the lowest recall performance is 
consistent with their performance on all Immediate Memory tasks. This 
distinction into older (III-8 to VI-12) subjects and younger (I-6 to II-8) 
subjects is most significantly demonstrated for the recall of 6 letters at the 
nine-second exposure level. Although the I-7, H-7, and II-8 subjects 
demonstrated significantly greater recall than the I-6 subjects in Iconic 
Memory, no significant differences are found on any of the Immediate 
Memory tasks. This finding seems to indicate that, until a functionally 
effective Iconic Memory is developed, all groups are unable to effectively 
function on Immediate Memory tasks. 


The results of the three-way analysis of variance indicate that the 
exposure level and stimulus density are significant factors in recall, but the 
sex of the subject is not. As the exposure time increased, the significant 
differences between the older and younger subjects increased until, at the 
nine-second exposure level, the recall of all the older subjects (III-8 to VI-12) 
was significantly greater than the recall of all the younger subjects (I-6 to 
II-8). This finding agrees with Belmont and Butterfield (1971) and Dornbush 
and Basow (1970) that the younger subjects cannot store the information for 
effective recall. This conclusion is emphasized by the linear relationship 
between exposure time and the recall of the six-letter displays demonstrated 
by the III-9 to VI-12 groups. The I-6 to III-8 groups do not demonstrate this 
consistent trend for recall to increase as exposure time increases. The 
interpretation given this finding is that the older subjects have developed the 
ability to store the information and the increases in exposure time also 
permits the increased use of storage and coding strategies. 


The effect of exposure time on the recall for the twelve-letter displays was 
to provide further distinctions amongst the older subjects. As in the six-letter 
conditions, as exposure time increased, the number of significant differences 
between groups increased as well. As the exposure time increased, the 
differences between the older and younger subjects became more significant. 
This finding suggests that the storage strategies demonstrated by the III-9 to 
V-11 groups for the six-letter conditions are not generally effective in storing 
complex stimuli. The linearly related increases in recall and exposure time 
for the VI-11 and VI-12 groups seem to suggest that, along with their recall 
superiority over the majority of the groups, the grade six subjects have 
developed effective mnemonic strategies to deal with high density stimuli. 
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The lack of this trend in the other groups indicates that the effect of 
increasing exposure time appears to aid confusion rather than recall. The 
criteria presented earlier as indicating a functionally developed Immediate 
Memory were: (1) the linear increase in recall for the high density stimuli as 
exposure time increased, and (2) the higher recall of the high density as 
compared with the low density stimulus. These criteria lead to the 
interpretation of the results as indicating that the VI-11 and VI-12 subjects 
are in the final stages of the development of Immediate Memory. 


Summary 


In the present study, Iconic Memory was measured by the amount of 
information available for recall. The amount of information processed, or 
Iconic Memory capacity, was in the final stages of development in 
eight-year-olds in grade three, but functionally developed in nine-year-olds in 
grade three. Although the younger subjects (I-6 to II-8) demonstrated 
significant between-group differences on Iconic Memory tasks, these 
differences did not occur on the Immediate Memory tasks. The development 
of a functionally developed Iconic Memory appears essential to the 
development of an effective Immediate Memory process. 
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Measurement of Community School 
Development 


This study analyzed the development of community schools in the province of 
Alberta. The components of community schools were identified from the 
literature and a questionnaire was developed to measure the extent to which 
these components were found in the community schools of Alberta. The top 
ten community schools were then compared with ten randomly selected and 
matched comparison schools. 

A relatively large percentage of schools in Alberta (7.3%) were identified 
as community schools. Of the seven community school components, none 
were developed in the community schools to the extent suggested in the 
literature. However, the top ten community schools were significantly 
different from the comparison schools on each of the community school 
components. Each component was more highly developed in the community 
schools than in the comparison schools. (Dr. Sullivan is the Coordinator of 
Continuing Education at the Atlantic Institute of Education.) 


The purpose of this study was to identify the community schools 
operating in Alberta and to analyze the extent to which components of 
community school education have been developed in these schools. In 
addition, a comparison of community and non-community schools was made 
to determine whether or not the schools differed in the components 
considered to be characteristic of community school education. 


Tangible evidence of a desire to develop the components of the community 
education or community school concept in Canada can be seen in the reports 
such as the Edmonton Public Schools’ (1971) “The Community School: A 
Focus on Living”; The Nova Scotia Royal Commission on Education, Public 
Services and Provincial Municipal Relations, Volume III (Government of 
Nova Scotia, 1974); and The Province of Ontario’s What Happens Next Is Up 
To You (The Select Committee on the Utilization of Educational Facilities, 
1975). These three Canadian reports advocate the introduction and 
development of community schools. However, they draw on little or no 
evidence of the characteristics or success of community schools as 
documented by research. 
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This lack of research referencing is not entirely the fault of the report 
writers because little research has been completed in Canada or the United 
States in the area of community schools and/or community education 
(Decker, 1972). Seay and associates (1974) stated that, “With the exception of 
the 15 doctoral dissertations cited by Totten and doctoral dissertations along 
with some masters theses included by Olsen, most of the (American) works 
cited are descriptions of experiences of practitioners and observers in the 
field” (p. 393). No publication dealing with the community schools other than 
on an anecdotal, or participant frequency count level, appears to be available 
in Canada. 

Therefore, with the increasing interest in community schools throughout 
Canada and the lack of research on the subject, the area appeared to be 
appropriate for study. The need for studies in this area was emphasized by 
Musella (Coutts, 1975) in his address at the 1975 Canadian Society for the 
Study of Education annual conference. In this address entitled ‘Critical 
Issues Facing Educational Administrators: Directions for Future Research”, 
he listed community education as one of the nine critical issues facing 
educational administrators in 1975. Musella elaborated by providing the 
following two categories for research: first, “a need for a new model of 
community education which integrates that function into the total 
community services”; and second, ‘“‘the nature of community education”’.1 


This study could best be described as an exploratory study. A lack of 
research in the area of community education and community schools made it 
essential to develop an instrument to assist in the identification of existing 
community schools and to determine the amount of development in each of 
the community school education components. 


Method 
Conceptual Framework 

The conceptual framework of this study drew from the six basic 
components of community education as proposed by Minzey (1974), who lists 
the following six components as being essential to an effective community 
education program. 

Educational program for school age children. This component of 
community education is already available and operating in all public 
schools. It is included in the basic components, so that the grade 1-12 
program is seen in its proper perspective, i.e., as a vital part of community 
education. Too often community educators leave the regular school program 
out of their definition of community education. However, it is also important 
to remember that this “regular” school program is only a portion of the total 
package of community education. 

The major change to be made to the regular school curriculum, in relation 
to community education, is to make the curriculum more community based 
and relevant to every day living. This could be accomplished by including 
work experience projects in every child’s timetable; by having guest lecturers 
from the community speak, on a frequent and regular basis, to the children in 
their classrooms; and by designing textbooks and class assignments to 
reflect the local community. 
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Joint use of school and community facilities. This component of 
community education involved the use of the school buildings by the 
community members during the day, on weekends, at night, and during 
school holidays. In addition, this component includes the use of community 
facilities by the school staff and students, e.g., community swimming pool or 
rink. 


Additional programs for school age children and youth. Included in this 
aspect of community education are programs designed and executed for 
school age children before and after school hours, on weekends and during 
holidays. The activities could include enrichment, remedial, and 
supplemental educational programs, as well as recreational, cultural, and 
avocational programs. 


Programs for adults. The community education concept also includes 
programs for the adults in the community. The student body, in this instance, 
is perceived as all of the people who reside in the community served by the 
school, regardless of age. Programs similar to those described above for 
school age children and youth should be available for the adults. In addition, 
the adults should be free to participate in a regular academic day program. 


Delivery and coordination of community services. “The key role of schools 
is catalytic and the school would not provide programs or services which are 
already provided or capable of being provided by other agencies” (Minzey, 
1974, p. 2). 


Communities usually have an abundance of services, although a lack of 
coordination of these services usually gives the impression that more money 
and manpower are needed. One of the basic goals of community education is 
to coordinate existing services of the community before new and expanded 
programs are started. 


Community involvement. The final component in Minzey’s community 
education conceptual framework is effective community involvement. The 
community members must be involved in important decision-making in all 
aspects of the community education program. This is usually accomplished 
by the formation of community councils composed of representatives from as 
many sectors of the community as possible; for example, students, senior 
citizens, parents, business persons, teachers, the unemployed, etc. It is 
important that these people be involved in important decision-making, not in 
trivial decisions as given to the Parent-Teacher Associations of the past. 


Because much of the literature suggests that community school education 
cannot be pursued with great vigour or success without sufficient funds and 
an administrator (coordinator) spending a considerable amount of time 
promoting community school relations, a seventh component was 
introduced: “Administration: Coordinator and Finances.” This component 
dealt with the percentage of time staff members of the school devoted to the 
promotion of community-school relations and the amount of financial 
support provided in the school budget for facilitating a school-community 
program. 


Questionnaire 


Using the above seven components as the framework for community 
schools, a School Community Questionnaire (SCQ) was developed to measure 
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the extent to which a school had developed these community school 
components. Questions were designed to gather information on each of the 
seven components. For the purpose of illustration, the questions from the 
section dealing with additional programs for school age children are 
presented below and the method of scoring is explained. 


Section 3 — Additional programs for school age children. The questions in 
Section 3 of the SCQ were constructed to obtain data on the third dimension 
of community school education; i.e., additional programs for school age 
children. 

3.1 This question is concerned with programs for school age children (here 
termed “additional” programs) which take place before and after regular 
school hours, on weekends and during holidays. These programs could 
include enrichment, remedial or supplemental educational activities, as 
well as vocational or religious activities. Approximately how many of 
these additional programs per week are under the supervision of: 

(1) regular school staff and take place on or off the school property? 
programs/week.......... 

(2) “non-school” staff (not paid a salary by your School Board or 

Committee) and, yet, take place on school property? 
programs/week.......... 

Question 3.1 was constructed to gather data on the number of additional 
programs offered for school age children. 

Scoring: This question was scored by dividing the total number of 
programs offered by both regular school staff and “non-school” staff, by the 
number of professional staff in the school (Question 1.62). This gave an 
indication of the number of additional programs offered to the children in 
relation to the size of the professional staff. To facilitate further reference to 
this proportion, it was call X . 


Therefore, Question 3.1 (1) and 3.1 (2)= X, 
Question 1.6! 


The hypothetical model community school would score 1 point for Xz, 
demonstrating that the number of additional programs offered at the school 
equalled the number of professional staff members at the school. 

3.2 Approximately how many students participate per week in the programs 
identified in: 
(1) question 3.1 (1) above? NOTE: If a program has more than one session 
per week, account for the students each time they attend a session 
students/week.......... 
(2) question 3.1 (2) above? NOTE: If a program has more than one session 
per week, account for the students each time they attend a session 
students/week.......... 

Question 3.2 was designed to gather information on the number of 
students participating in additional programs. 

Scoring: The question was scored by dividing the total number of 
participating students by the number of daytime students attending the 
school (Question 1.5%). This proportion was called Xs 


207 


Keith C. Sullivan 


Therefore, Question 3.2 (1) and 3.2 (2) = Xx 
Question 1.5! 


The hypothetical model community school would have had Xs = 1, indicating 
that as many children participated in the additional programs as in the 
school’s regular daytime program. 


In a similar manner, each of the community school components was 
measured. Using the current literature’s description of “good” community 
schools, each component was designed to have a total score of four points.’ 
Therefore, a model community school would score four points on each of the 
seven community school education components and obtain a total score of 28 
points. 


Sample and Procedure 


Ninety-four Alberta schools were identified by school superintendents as 
following a community school mode of operation. For the purpose of this 
study, these schools were called community schools. The SCQ was sent by 
mail to the principals of the 94 community schools. Eighty-three of these 
SCQ’s (89.2%) were completed and returned. Comparison schools were 
randomly selected from the Alberta schools not identified by the 
superintendents as community schools and, eventually, these schools were 
matched with the top 10 community schools (as measured by the SCQ) on 
variables such as school size, school grade levels and type of school 
community. SCQ’s were given to the principals of the 10 comparison schools 
and all of these questionnaires were completed and returned. 


Treatment of the Data 


All data from the SCQ’s were coded and written on computer data 
punching forms. Then computer cards were punched and verified by a 
trained keypunch operator. 


The questionnaire variables were analyzed with the assistance of t-tests 
and F tests. Whenever the F ratio was significant, this test was followed by 
the Scheffé method of testing for significant differences between pairs of 
means. The a priori level of significance for all tests was set at .05 
(two-tailed). 


Results 
SCQ Scores from All Community Schools 


The mean data obtained from the 83 SCQ’s, for each of the community 
school education components, are presented in Table 1. The most highly 
developed community school education component in Alberta community 
schools was Delivery and coordination of community services, with a mean 
score of 3.4 points. According to the SCQ scores, the rank order of the 
remaining community school education components, as developed in Alberta 
community schools, was: (2) Additional programs for school age children and 
youth, 2.6 points; (3) Educational program for school age children, 2.3 points; 
(4) Joint use of school-community facilities, 1.7 points; (5) Administration, 1.6 
points; (6) Community involvement, 1.2 points; and (7) Programs for adults, 
1.0 points. 
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TABLE 1 


MEAN SCORES ON COMMUNITY SCHOOL EDUCATION COMPONENTS FOR 
THE EIGHTY-THREE ALBERTA COMMUNITY SCHOOLS 


Community School Mean Score Score for 
Education Components Hypothetical Model 
Community School 


1) Educational program for 
school age children 75,3) 4.0 


2) Joint use of school and 
community facilities WF 4.0 


3) Additional programs for 


school age children and youth PaaS) 4.0 
4) Programs for adults iL6@ 4.0 
5) Delivery and coordination 

of community services Bad 4.0 
6) Community involvement ihe 4.0 
7) Administration 1.6 4.0 

TOTAL 1358 28.0 


The hypothetical model community school scored four points on each of 
the community school education dimensions and, therefore, accumulated a 
total score of 28.0 points. The Alberta community schools’ mean total score 
was 13.8 points. 


SCQ Scores from Community Schools and Comparison Schools 

The SCQ data obtained from the top 10 community schools and 10 
matched comparison (non-community) schools are presented in Table 2. The 
community schools’ mean scores for each of the seven community school 
education components were all significantly higher (p<.05) than the 
corresponding comparison school mean scores. This would appear to indicate 
that the community schools, as measured by the SCQ, developed the 
community school education components more than did the comparison 
schools. The greatest differences were found in Additional programs for 
school age children and youth, Programs for adults, Delivery and 
coordination of community services, and Administration. 

The rank ordering of the community school education components differ 
between the two types of schools. The most highly developed component of 
the community schools was Delivery and coordination of community 
services, followed by Additional programs for school age children and youth. 
Both of these scores were higher than the scores assigned to the hypothetical 
model community school. The least developed component of the community 
schools was Community involvement, having less than half the score of the 
hypothetical model community school. 

None of the comparison school mean scores were as well developed as the 
hypothetical community school scores. The most highly developed 
component was Educational program for school age children, followed by 
Delivery and coordination of community services. The least developed 
component of the comparison schools was Administration. 
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TABLE 2 
MEAN SCORES ON COMMUNITY SCHOOL EDUCATION COMPONENTS FOR 
THE TOP TEN COMMUNITY SCHOOLS AND TEN MATCHED 
(NON-COMMUNITY) COMPARISON SCHOOLS 


Community School Community Comparison P Significance 
Education Schools-- Schools == (twosteated) 
Components Mean Score Mean Score 


1) Educational 
program for 
school age 
children PEEK PEWS) 0.01 Sion 
2) Joint use of 
school and 
community 
facilities Sei ioe 0.02 Sig. 


Ww 


3 


wm 


Additional 
programs for 
school age 
children and 


youth Live, 78} Way 0.00 Sig. 
4) Programs for 

adults Aa 0.54 0.00 Sich 
5) Delivery and 

coordination 

of community 

services Ho Bie Wo 0.00 Sig. 
6) Community 

involvement 90 0.83 0.03 Sigs 
7) Administration 2.80 0.50 0.00 Supe 


Discussion 


Considering that, in Canada, community schools came into focus only in 
1971, a relatively large percentage of schools in Alberta (7.3%) were identified 
by their superintendents as community schools. Of the seven components of 
community school education utilized in this study, none was developed in the 
community schools of Alberta to the level of the model community school. In 
fact, four of the components, Joint use of school and community facilities, 
Programs for adults, Community involvement, and Administration were 
developed to less than one-half of the level of the model community school. 


This relatively large percentage of schools identified by superintendents 
as community schools and the low scores obtained by these schools on the 
components of community school education imply that superintendents’ 
concepts of community schools are different from those stated in community 
school literature. The desire of superintendents to identify schools as 
community schools may indicate that much more emphasis should be placed 
on understanding the community school education concept by people at all 
levels, i.e., provincial Department of Education personnel, local school 
administrators, and university faculty. 
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Alberta community schools generally should place more emphasis on 
each of the seven community education components. However, special 
emphasis should be placed on promoting joint use of school and community 
facilities, developing adult programs, and encouraging community 
involvement in education decision-making. Also more community school 
coordinators should be hired and/or administrators encouraged to spend 
more time promoting school-community relations. 


On a positive note for community school advocates, the top 10 community 
schools were significantly different from the comparison schools on every 
community school education component. Each component was more highly 
developed in the community schools than in the comparison schools. 
Therefore, although the community school education concept is not well 
understood by educators, it is practically attainable by public schools. It 
appears that a school staff can develop a significantly different type of 
school by following the community school education concept. 


The SCQ’s conceptual framework, drawn from the work of Minzey, 
seemed to be appropriate for the study of community schools. Practitioners 
interested in community school education should be able to use the SCQ to 
assist in determining the extent to which their schools have developed the 
components of community school education. Consequently, from the SCQ 
scores, it would be relatively simple to ascertain which community school 
education components need more attention and development. 


Recommendations for Further Study 


This study was restricted to the province of Alberta. Other studies might 
examine community schools in other provinces on the variables examined in 
this study. There seems to be a need for further work on the SCQ to measure 
the development of community school education components, particularly to 
determine if each of the seven components should be equally weighted and to 
detect any bias in the instrument for schools of various sizes. 


This study is drawn from a larger study completed by the author and entitled “Community Schools: An 
Analysis of Organizational and Environmental Characteristics”, Unpublished doctoral dissertation, Sullivan, 
1976. 


Notes 
1. Taken from a hand-out distributed at the 1975 Canadian Society for the Study of Education Conference, 
Edmonton, Alberta. 
2. Number of professional (certificated) staff in school. 
3. Number of daytime students in school. 


4. Any score above four points was reduced to four points so that a school could not score extremely high on the 
total score by having developed only one component extremely well (e.g., adult classes) and left the others 
poorly developed. 
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BOOK REVIEWS 


RESEARCH IN EDUCATION (THIRD EDITION). By John W. Best. 
Englewood Cliffs, New Jersey: Prentice-Hall, Inc., 1977, $14.95 (U.S.), 403 pp. 


Institutions that offer graduate degrees in education must introduce 
students to the language and techniques of research. The task is not an easy 
one, largely because of the diverse backgrounds of the students, whose fields 
of expertise might range from history to algebra, linguistics to vocational 
education, or Shakespeare to statistics. Most institutions require incoming 
candidates to enrol in a course in research methods, for which a cluster of 
texts is available. One of the texts intended for this purpose is the third 
edition of Best’s Research in Education. 


The book consists of ten chapters covering topics that are common to most 
texts in the field. The usual sequence is a beginning chapter on the general 
nature of research, followed by others that more or less parallel researchers’ 
efforts in developing problems and carrying them through to the final 
reports. As might be expected, the bulk of the content can be found in other 
texts such as one written by Travers (1969) or by Englehart (1972). 

Although the content includes the expected topics, Chapter 3, on using 
reference materials, goes far beyond what is provided in other texts. After a 
routine endorsement and description of library facilities, such as the card 
catalogue and microfiche, the author includes information of particular value 
to apprentice researchers in education. Under such headings as “References 
about references” (p. 43), “The index” (p. 46), “Abstracts” (p. 57), 
“Encyclopedias” (p. 61), and “Instructional aids” (p. 77), literally hundreds of 
sources are listed. The inventory is extensive (the Canadian Education Index 
is included) but it would be impossible to name all appropriate references (the 
Alberta Journal of Educational Research is omitted). The list in its present 
form is of unquestionable value, but a little more organization would have 
been helpful. It would have assisted the reader to place the titles in 
alphabetical order instead of what appears to be random order. Also, the 
heading “Biographical references” appears twice (p. 72 and p. 74) for no 
apparent reason. 

An outstanding feature of Best’s book is its readability. The reader gets 
the impression that there is a crisp clarity that stems from the author’s 
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genuine enthusiasm for his subject. There are parts of the book that read 
more like a novel than a text on a topic that is often described by students as 
deadly dull. 


The content is developed at a basic level with extensive references to 
concrete applications. For some readers, the approach would be considered 
too simplistic, in that several essential concepts are brushed over very 
lightly. For example, test validity is treated in little more than one page, 
while test reliability receives only nine lines of text. The standard error of 
measurement is not mentioned. 


In a text intended for the purpose of introducing research procedures to 
prospective graduate workers, it is difficult to understand why some research 
studies are dealt with in such detail. For instance, nine pages are devoted to 
Terman’s Genetic Studies of Genius in Chapter 3 dealing with descriptive 
statistics, and almost four pages are devoted to the Eight Year Study. Itis not 
clear if these projects are intended to illustrate certain procedures or if they 
are presented as interesting studies in themselves. 


The author’s efforts to achieve understandability through simplicity of 
presentation causes a few problems. Chapter 6, the Tools of Research, 
represents a grouping of topics that does not seem logical. Questionnaires, 
opinionnaires, and interviews certainly belong in this section, but placing 
“qualities of a good test” (p. 188) between ‘projective devices” and 
“sociometry” does not seem reasonable. In the same chapter, Best’s talent for 
clarity seems to have momentarily deserted him in the description of 
edge-punched cards. 


Another problem that stems from the desire for simplicity is the 
occasional misleading passage. The discussion of percentile ranks makes no 
distinction between a percentile and a percentile rank and actually uses the 
term ‘‘percentile rank”? when the term “percentile” is appropriate (p. 241). 
Also, the impression is given that T scores and stanines are computed 
without first normalizing the distribution. 


Research in Education is, as stated on the dust jacket, “an invaluable tool 
for the beginning graduate worker in the field of education.” This applies 
especially if the students initially have uncomfortable feelings about 
statistical procedures, which is often the case where the research project 
required for a master’s degree is less formal than a thesis. Some 
shortcomings are evident, but because none of them are too severe, the book 
should seriously be considered as a text or major reference for an 
introductory course in research methods. 


V.R. Nyberg 
Department of Educational Psychology 
The University of Alberta 
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CANADIAN EDUCATION: A SOCIOLOGICAL ANALYSIS. By Wilfred 
B. W. Martin and Allan J. Macdonell. Scarborough: Prentice-Hall of Canada, 
1978, 354 pages. 


Martin and Macdonell have provided educators with a long-awaited 
sociology of education textbook which has a truly Canadian emphasis. As 
the authors say in the preface of their book, they have focussed on the 
microsociological approach, on the macrosociological approach, and on 
social change, in their analysis of education in Canada. Functionalism, 
conflict, and symbolic interactionism perspectives are used to direct their 
presentation. 


Part I of the four-part book provides the reader with information about 
Canadian society its population, languages, religions, government, 
economy, and educational structure. A brief account of the Canadian 
historical background assists the reader to understand the present economic, 
political, ideological, and social environments in Canada in relation to the 
educational scene. 


Part II focuses on the social organization of the school generally a 
microsociological approach to the study of education. Groups and grouping 
in the school are discussed using sound sociological theory. Here is a book 
where the teacher as a part of the school social environment is not forgotten. 
The formal organization, the finance, and the legal dimensions of education 
in each of the Canadian provinces and in the territories (to a limited extent) 
are analysed, as are interest groups at the local, provincial, and federal 
levels. By including an overview of history of education in Canada, the 
authors provide a base for what exists today. Part II concludes with a chapter 
devoted to the analysis of changes in population, schools, and school 
systems. 


In Part III, the authors move to a macrosociological approach in their 
analysis of education in Canada. They explore educational opportunities by 
surveying the financial resources, teacher qualifications, student 
populations, and denominational differences of school systems. What appear 
to be the impacts of urban and rural environments and of culture and social 
class on educational opportunities are summarized. Social mobility and 
education, and life chances of Canadians are assessed. Finally, the authors 
deal with education in Canada and the labour market. What has been 
presented is interpreted from structural functionalism and _ conflict 
perspectives. 

The three views of society — functionalism, conflict, and symbolic 
interactionism — are used in Part IV as approaches to the study of social 
change and education in Canada. Demographic, economic, and national 
identity changes and their probable consequences for education (changes in 
schools, in finance, in interest groups, and in educational opportunities) are 
explored. In addition, the authors look briefly at education and the future. 

There are three very valuable appendices in the book. The first 
summarizes the development of sociology of education; the second explains 
the three sociological approaches used in the book. The third appendix is 
quoted from Section 93 of the British North America Act. 

The book seems truly Canadian in its source material. In the 38 pages of 
references, there are more than 800 sources listed, of which approximately 
600 have Canadian authors, samples, and/or publishers. Statistics Canada, 
beginning with 1972 publications, is used extensively. 


The format of the book — parts, chapters, and appendices in 302 pages — 
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supplies an organized presentation to the reader. Chapters have a brief 
introduction stating what is to come, followed by headed and subheaded 
sections and a concise summary. Charts, tables, and notes at the end of each 
chapter provide valuable information. Both name and subject indexes 
increase the usefulness of the book. 


Generally, the authors appear to have produced the kind of book they say 
they have written. Students of education and of sociology and professors 
should find it a valuable textbook. While the authors recommend that 
students have a prior introductory course in sociology before using their 
book, and while the reviewer advises that all sociology of education students 
have an introductory sociology course as a prerequisite, the style of writing of 
Martin and Macdonell seems to make an introductory course unnecessary. 
Their writing appears to focus at the undergraduate student. 


Except that the paperback edition tends to come apart, the reviewer finds 
very little to criticize about Martin and Macdonell’s book. The symbolic 
interactionism perspective does seem to have received less attention than 
structural-functionalism and conflict perspectives. While Canadian history 
is featured in some chapters, it appears appropriate to include it in this 
sociological analysis of Canadian education. 


Martin and Macdonell have contributed a very valuable book in the field 
of sociology of education. It is current, it has a historical base, it is Canadian, 
and it is sociological in perspective. 


Helen L. Pierce 
Department of Educational Foundations 
The University of Alberta 


WHY BE A HEADMASTER? By J. R. Sutcliffe. Melbourne: Melbourne 
University Press, 1977, $11.80 (Australian), 170 pp. 


During the last decade there has been a steadily growing interest in the 
subject of headmastership. Edmonds began it with The First Headship (Basil 
Blackwell, 1968); this was followed by Headship in the 1970’s (Bryan Allen 
[Ed.], published by Basil Blackwell, 1968), and Running a School (C. H. Barry 
and F. Tye, published by Temple Smith, 1972). In both the United States and 
Canada, study of the role of the principal has become a major industry, most 
university graduate courses in educational administration devoting a good 
deal of lecture-time and publication-space to the concept and practice of 
principalship. It might seem then, at first glance, that Sutcliffe is simply 
following a well-trodden path. 

Let it be said immediately that this is not the case. Why Be a Headmaster? 
is not a study of the theory of principalship, nor does it concern itself with 
such run of the mill questions as “the principal as curriculum coordinator,” 
“the principal as manager,” “the principal as allocator of human and 
material resources,” etc. Instead, Mr. Sutcliffe presents us with an intensely 
personal account of his life, including all those many and varied experiences 
that went into the making of an outstanding headmaster. 

But the book is no panegyric. One might add to Mr. Sutcliffe’s many 
delightful and apposite quotations one from the great bard himself: 


Speak of me as I am; nothing extenuate, 
Nor set down aught in malice. ( Othello V. II. 341-2) 
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Out of the triumphs and disappontments which Mr. Sutcliffe faithfully 
records, one may distill many of the lessons which, in university courses on 
educational administration, are more talked about as open-ended 
case-studies, rather than shewn in operation with practical outcomes. These 
include the vital importance of human communication at all levels; the 
intellectual and, no less important, emotional muscularity that any principal 
has to have; the need for a principal to exercise alternately authoritarian and 
democratic roles, at different times and in different situations. 


It may be that Mr. Sutcliffe represents a type of headmaster finding no 
place in this increasingly egalitarian school world of ours. When I read that, 
in Britain, a teacher who defied his headmaster was recently awarded 
considerable damages by an industrial tribunal, I wonder how deeply 
undermined the traditional authority of the Head has become? Are the 
escalating cases of violence (euphemistically dubbed disruptive behaviour) in 
State schools derivatives of it? Mr. Sutcliffe’s discipline was firm, but kindly 

the kind which Lord Esher’s famous judgment of 1894 prescribed. Not that 
this is the only way. I once had the privilege of visiting Summerhill (a private 
school) and formed a great regard for Neill’s approach. But circumstances 
alter cases; and one of the skills of headmastership lies in recognizing the 
difference. 


The closing chapters of Why Be a Headmaster? deal with Mr. Sutcliffe’s 
departure from schoolmastering to the stock exchange. One little quoted 
incident reveals the man Mr. Sutcliffe is. A client of his bought some shares 
by telephone and then repudiated them when the market fell. Mr. Sutcliffe 
took the shares into his own name and endured the financial loss. 

The book is a fascinating autobiography, written in a pleasant, modest, 
easy style. It is beautifully printed, with one or two excellent illustrations. I 
would like to see a copy of this book in every school library across the world, 
for it offers a philosophy of life so many adolescents can profit by when 
starting out on life: 

Desire not to live long but to live well; 
How long we live not years, but actions, tell. 
(Rowland Watkyns, Flamma Sine Fumo, The Hourglass) 
EK. L. Edmonds 
Professor of Education 
University of Prince Edward Island 


BELEVING EN ERVARING (EXPERIENCE AND LIVED EXPERIENCE): 
WERKBOEK FENOMENOLOGIE VOOR DE SOCIALE WETEN- 
SCHAPPEN. By Ton Beekman and Karel Mulderij. Amsterdam: Boom 
Meppel, 1978, 147 pp. 


In 1960, Stephan Strasser wrote about how, during the defense of a 
historical dissertation, a doctoral candidate was reproached for having 
neglected an important source. The student’s defense consisted of a simple 
dismissal of the objection with the astonishing remark that he was following 
the “phenomenological method.” A phenomenologist, so he said, has to 
analyze only a single representative case and does not have to waste time in 
carefully tracing and consulting sources. 


Occurrences such as this are possible only because of ignorance on the 
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part of the student as well as his or her dissertation advisors. 
Phenomenology as a form of inquiry sometimes is understood so little that it 
is made to pass for almost any kind of research that has a subjective 
orientation. Clearly, it is highly desirable that the field of social inquiry be 
protected from such dilletantism. 


How do you provide people with an introductory but adequate 
understanding of phenomenology? For years, Husserl had been under the 
illusion that it would be a comparatively simple matter to write a popular 
introduction to phenomenology, recounts Spiegelberg (1970), but in reality all 
his attempts throughout the last ten years (attempts which resulted in the 
London and Paris lectures and the French Meéditations) has been without 
satisfactory results. Ever since Husserl, phenomenologists have made efforts 
to make their ideas more readily available to the interested readers but 
without much avail. Possibly, Beleving en Ervaring (‘Lived Experience and 
Experience’) is the first successful text which manages to present to Dutch 
educators the most prominent features of the phenomenological method and 
attitude. (An English version is currently under preparation.) In contrast 
with many other introductions to phenomenology, this book by Ton 
Beekman and Karel Mulderij requires (on the part of the reader) little or no 
familiarity with traditional phenomenological sources. 


Beleving en Ervaring is a modest book of 147 pages, divided into four 
main parts. The first chapter introduces the idea of phenomenology by 
squarely relating it to the commonplaces of our everyday life. In the second 
chapter, “An Initial Acquaintance with Phenomenology and How We 
Worked with Students,” the authors present the systematic procedures and 
workshop activities that were used to introduce students, all educators, to 
phenomenology. The second half of the book, chapters three and four, are 
more discursive and explanatory than the workshop sections. They introduce 
the reader to a phenomenological view of “Knowledge and Reality” and toa 
discussion of aspects of “The Phenomenological Method.” What makes this 
discussion of special interest is that they are written in a phenomenological 
style, that is, the authors use phenomenological reflection as a way to 
introduce the readers to phenomenology. 


Beekman and Mulderij succeed in the felicitous endeavour to clarify major 
phenomenological concepts such as “life world” (Lebenswelt) and “‘eidetic 
reduction” without the use of this technical language. In their introductory 
chapter the authors note that there are two approaches to phenomenology: 
“you can talk about it and you can doit’. They set out to involve the reader in 
doing it. The reader is invited to partake in a number of exercises which are 
aimed at developing some sensitivity and artful competence in analyzing the 
essential structures of selected phenomena of everyday life “waking up,” 
“fear of the dark,” etc. But phenomenological method is not a skill, a 
technique, or a set of procedures which can be mastered in a straightforward 
manner. Therefore, Beekman and Mulderij intersperse their reflective 
display of workshop activities with a series of advisory comments. 


The first advice is: “Describe a personal, simple experience as much as 
possible in life-world language; watch out for interpretations and causalities 
in the description, and do not get lost in factual detail. The important point is 
the ‘lived’ aspect of it” (p. 28). The participants of the workgroup are asked to 
write out a personal experience on a topic such as “fear of the dark.” 
Beekman and Mulderij point out that a careful descriptive account of a 
personal experience is a preparatory component of phenomenological 
method — comparable to data collection in traditional empirical research. 
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The next task is to analyze this personal life-experience description for its 
structural elements (reduction of essences). 


Exercises such as these are further experimented upon by varying the 
examples or by thinking through analogous experiences. This is an attempt 
to look at the same phenomenon using shifting perspectives which may be 
only slightly different from one another. Next, the field of intersubjectivity is 
expanded by complementing personal life-world material with new material 
obtained from such sources as existentialist or phenomenological literature, 
poetry, novels, diaries, interviews, folk wisdom, observations, and art. The 
attempt is to come to an understanding of a given phenomenon through 
exhaustive description (rather than through causal explanation). 


By means of example and illustration, Beekman and Mulderij show that 
the central focus of phenomenological inquiry is on the structure of the 
life-world: The method is situation analysis. Naturally, Beekman and 
Mulderij do not work in a vacuum. They are the more recent proponents of a 
group of scholars which in the nineteen fifties became known as the Utrecht 
School. Important names associated with the University of Utrecht were 
Langeveld in pedagogy, Buytendijk in psychology, Riumke in psychiatry, and 
a whole array of like-minded scholars. The work of these individuals, in turn, 
was based on the thoughts of Husserl, Dilthey, Schleiermacher, Litt, Flitner, 
and others. The members of the Utrecht School wanted to experiment with 
new forms of psychology and pedagogy. Beekman and Mulderij tell how 
talented social scientists such as Langeveld and Buytendijk wrote jewels of 
descriptive analyses and heavy treatises about their methodology. You can 
admire them, Beekman says, and sometimes follow their thoughts, but 
seldom can you follow in their footsteps. 


So it is in Beleving en Ervaring that Beekman and Mulderij show how 
situation analysis, in a phenomenological sense, is done. They identify three 
distinct but interrelated facets of the investigative studies of the proponents 
of the Utrecht School: (1) there is the gathering of life-experience material; (2) 
this material is investigated for its descriptive analytic forms, or ground 
structures; (3) recommendations and orientations to practical action are 
formulated. In the actual work of individuals such as Langeveld and 
Buytendijk these elements are artfully interwoven in masterly descriptions. 
For newcomers to the enterprise of phenomenology, such descriptions are 
virtually impossible to reconstruct. Beekman and Mulderi do not claim to 
make full-fledged phenomenologists out of the participants of the workgroup. 
But there is little doubt that they succeed in providing the students and 
readers of their book with a worthwhile introduction to the confounding field 
of phenomenological inquiry. Beleving en Ervaring leads the reader into 
acknowledging that children experience their life-world differently than their 
teachers and parents. Note how Beekman and Mulderij reinforce this point in 
their eighth advice: 

As a pedagogue, gather and collect life-experience material of children. Give 
children the opportunity to be as “direct” as possible. In the written and oral 
expressing of experiences create a relaxed and communicative atmosphere. 
Record verbal expressions on tape and later transcribe them integrally. If you are 
less experienced in phenomenological analysis, use more life-experience material. 
(p. 55) 

Merleau-Ponty once remarked that in much psychological literature we 
seem to reduce the child’s experience to our own. Educators are often 
insensitive to the observation that children experience time, space, etc., ina 
different modality than adults. Beekman notes how the development of 
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investigative instruments and statistical techniques dominates educational 
research. Increasing amounts of time and funding is spent in sophisticated 
techniques. but as the processing of information and data becomes ever more 
advanced, it seems that the most human element is being ignored: man’s 
subjectivity. Phenomenology may become a countervailing force against the 
increasingly externalized experiences, drained of all subjectivity, which 
dominate the pages of educational research journals, argues Beekman. In the 
second part of their book, Beekman and Mulderij reflectively expand on the 
nature and possibilities of phenomenology. A phenomenological approach, 
according to the authors, can contribute to a more humanized theory of 
schooling (p. 94). It responds to the need in research for routes of access to an 
understanding of classrooms, schools and learning experiences as they 
appear to the child and the teacher. In these speculations Beekman and 
Mulderij seem to be running ahead of what they are able to offer us in their 
introduction to phenomenology. Meanwhile we look forward to an English 
version of their text. 


Max van Manen 
Dept. of Secondary Education 
The University of Alberta 
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ALAN D. BOWD 


University of Victoria 


Professional Characteristics of Canadian 
Educational Psychologists* 


One hundred and eighty-seven individuals teaching in educational 
psychology departments within Canadian universities were surveyed 
regarding their training, professional activities, and identity. Within group 
differences according to principal teaching areas are described and some 
implications discussed. (Dr. Bowd is an Associate Professor in the Faculty 
of Education, University of Victoria.) 


Educational psychologists in Canadian universities make important 
contributions to teacher education, principally in the areas of learning, 
child development and educational measurement. They are responsible in 
large part for the professional education of school counsellors, school 
psychologists, and special education teachers. Because of the variety of 
tasks and responsibilities delegated to them within faculties of education, 
distinctive areas of specialization have developed; consequently, differences 
in role behaviour among educational psychologists have emerged. Such 
differences are, of course, also a reflection of differences in training and 
background. The distinctive functions of educational psychologists have led 
to the establishment of separate departments of educational psychology in 
many education faculties. In some cases, educational psychology has been 
further divided into departments concerned with the education of 
exceptional children, counsellor education, and educational measurement. 

The objective of the present study is to describe the professional 
characteristics of educational psychologists as a whole and, at the same 
time, to examine any differences which may characterize specializations 
within educational psychology. Educational psychologists have maintained 
an identity distinct from other educators, presumably in part because of 
their training, professional affiliations and apparent identification as 
“nsychologists.”” Schmidt (1976) has remarked that “they do not know 
whether to identify primarily with educators or with psychologists,’ an 


* Based in part upon papers presented to the Annual Meeting, Canadian Psychological Association, Vancouver, 
1977, and the Annual Meeting, Canadian Society for the Study of Education, Fredericton, 1977. 
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observation reinforced by McKeachie’s view that educational psychologists 
“feel like a minority group” vis a vis their colleagues in psychology 
departments (Warren, 1976). Information concerning the professional 
characteristics and activities of educational psychologists is necessary if 
the issue of their “identity” is to be clarified; however, it also bears upon 
such important concerns as training programs (Keats, 1976; Schmidt, 1976) 
and the most satisfactory professional organization to meet the needs of the 
group (Warren, 1976). Most important, questions relating to the future 
prospects of educational psychology, particularly its role in teacher 
education, depend upon a clear present understanding of the role behaviour 
of educational psychologists (Biggs, 1976; Englander, 1976; Nixon, 1976). 


Design 

Subjects 

All full-time members of educational psychology and allied departments 
(e.g., counsellor education, special education, applied psychology) were 
listed from current university calendars and surveyed (N = 285). The 
population did not include members of non-departmentalized education 
faculties or regular psychology departments. 187 questionnaires were 
returned, representing a response rate of 66%. 


Respondents were placed in one of four categories according to the 
principal current teaching area they indicated. The categories were 
Learning and Development (LD), Counselling (CO), Special Education (SE) 
and Measurement (ME). Eleven respondents who stated other than the 
preceding teaching areas were deleted, and the data for each category 
analysed separately. 


Method 


Questionnaires were mailed to all subjects with a covering letter and 
return addressed envelope. The questionnaire was anonymous; however, 
provision was made for subjects to indicate their name and address if a 
preliminary report of results was desired. The form contained 14 items 
dealing with the following issues: 

1. Biographical data (age, sex, degree status); 

2. Professional experience (field experience, university teaching 
experience, current teaching and research interests, time allotted to 
professional activities); 

3. Publications in refereed journals over the previous five years; 

4. Professional “identity” (membership in professional organizations 
and expressed allegiance to them, journals read, identification as a 
“psychologist” or “educator’’). 


Results and Discussion 
The proportion of women teaching in educational psychology 
departments is 18.7%, a relatively high figure in comparison with 10.3% 
reported for the social sciences generally (Woodcock & Sullivan, 1975). 
Within educational psychology departments, the proportion of women is 
highest in special education (29%) and lowest in the measurement area (4%), 
a disparity reflecting traditional limiting sex-role expectations. The sample 
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iy 
HIGHEST DEGREES HELD rp OF EACH GROUP? 
LD co SE ME Total 
Ph.D. 80 67 71 78 76 
Ed.D. 9 ot 13 17 13 
M.Ed 5 9 13 4 a 
M.A 4 2 3 - 3 
Other 3 = = es 1 


* In this and subsequent tables, percentages have been rounded off to the 
nearest whole number in each category. Principal teaching area divisions 
are abbreviated thus: LD (learning and development), CO (counselling), 
SE (special education), and ME (measurement). 


TABLE 2 


YEARS OF EXPERIENCE IN UNIVERSITY TEACHING AND IN 
OTHER EDUCATIONAL SETTINGS 


LD CO SE ME Total 
University 
Mean A Dhbes Fae 8.63 7.68 LO) 52 O14 
Standard Deviation 8.04 By 40) She 1S) BiotfXe) 6.36 
Other 
Mean iq oii GOD) 9.68 : 6.48 6.39 
Standard Deviation 4.66 6.40 The Sill 6.88 6.00 


ranged in age from 27 to 71, with a mean of 43 and little variation across 
teaching areas. 


As indicated in Table 1, 89% of the sample possess the doctorate with 
only slight variations across teaching areas. The Ed.D. is relatively 
uncommon among educational psychologists with the highest proportion in 
counselling. 

When the figures for past professional experience (Table 2) are 
considered in relation to the average age of the group, it is evident that the 
“typical” individual began university teaching in his or her early thirties 
after about six years of experience in the school system. The tendency in 
more applied areas, particularly special education, is for academics to have 
considerable experience in other educational settings before beginning to 
teach at the university. Eighty-three per cent of the sample as a whole 
reported work experience in educational settings other than the university. 

When the specific nature of past field experience is examined, clear 
differences emerge according to areas of specialization. The majority of 
individuals in all areas have teaching experience in the schools. The very 
great majority of counsellor educators report experience as_ school 
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TABLE 3 
PERCENTAGES REPORTING SPECIFIC PAST FIELD EXPERIENCE 


LD co SE ME Total 
Teacher UP 65 77 78 72 
Counsellor 28 84 32 26 40 
Special Education 6 5 45 _ ll 
Teacher 
Psychologist 42 49 55 DD; 42 
Administrator Bz 26 55 43 BZ 
Other 18 14 ils) ins} 17, 


counsellors. However, the proportion of those in special education who do 
not have related prior experience as a special education teacher is greater 
than 50%. Outside of the special education area, the number of individuals 
having worked as special education teachers is virtually zero. While the 
demand for specialized teachers of exceptional children is increasing, it is 
evident that there is a parallel need for experienced and qualified 
individuals to train them within education faculties. 

Respondents showed a high level of agreement between stated teaching 
area and principal research interests. For example, only 7% of all 
respondents in teaching areas other than special education indicated that 
their principal research interest was in special education. 


TABLE 4 
PUBLICATIONS IN REFEREED JOURNALS OVER PREVIOUS FIVE YEARS 


LD co SE ME Total 
Nil 20m 20.93 PPD XS) Lo 3S WS) 7S 
One 134972 1369S) IS), 35) 4.35 127783 
Two 12.66 ), 310) 9.68 13.04 123 
35, 22.78 Sih 2al. 82526 Sie) sills 29.44 
6-10 AD 2S LI 3} Se 28) ye 16.58 
Wik=7A0) 5.06 4.65 L250 4.35 6.42 
7a 5.06 2533) == ==> 3.74 


Subjects were questioned concerning their record of publications in 
refereed journals over the previous five years. Publication counts of 
psychologists in Canada (e.g., Nelson & Poley, 1971) have indicated a mean 
publication rate of about one per person per annum. The median among the 
educational psychologists forming the present sample is 4 over 5 years; 
however, approximately one-third of respondents do not publish or are 
one-study publishers. This figure is lower than has been cited for educators 
generally (Arlin, 1977), though the discrepancy may be attributable in part 
to varying methods of data collection. 


The highest publication rates were reported for the measurement and 
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learning/development areas, where respectively 26% and 30% of 
respondents claimed six publications or better over the previous five years. 
Publication rates tended to be slightly lower in the more applied areas of 
counselling and special education, where respectively 35% and 42% of 
respondents reported no publications or only one over the previous five-year 
period. 


TABLE 5 


PERCENTAGES REPORTING CURRENT MEMBERSHIP IN 
PROFESSIONAL ORGANIZATIONS 


LD co SE ME Total 

Canadian Psychological 46 4l 2% 13 41 
Association 
Canadian Society for the 
Study of Education a me ae oe me 
Canadian Guidance and 4 7h 6 4 20 
Counselling Association 
Council for Exceptional 

21 
Children ? ue ie : 
American Educational 39 j 29 65 35 


Research Association 


Membership in professional organizations is shared almost equally 
between two “umbrella” organizations (C.P.A. and C.S.S.E.) with C.G.C.A. 
and C.E.C. accounting for more specific applied interests in counselling and 
special education. A large proportion of educational psychologists from 
fields other than counselling belong to A.E.R.A.; however, when asked to 
indicate the association to which they felt primary allegiance, only 9% 
elected A.E.R.A.. This compares with 18% choosing C.P.A., 14% C.S.S.E., 
14% C.G.C.A., and 7% C.E.C. Clearly there is considerable overlap in 
membership between organizations, though it appears that both C.P.A. and 
C.S.S.E. are serving educational psychologists as “parent” organizations 
for the disciplines of psychology and education. This conclusion tends to be 
supported by the fact that C.P.A. is the preferred organization of 
respondents who identify as “psychologists” while C.S.S.E. is preferred by 
those who identify first as “educators.” 

As shown in Table 6, educational psychologists report spending the 
largest proportion of their time in teaching with about equal added 
emphasis upon field-related and research activities. Only in the counselling 
area does field-related activity clearly demand a larger proportion of time 
than research. 

Whether educational psychologists identify as educators or as 
psychologists depends upon their teaching area, professional experience, 
and training. As indicated in Table 7, large majorities of individuals in 
special education and measurement identify as “educators” with those who 
identify as “psychologists” predominating in counselling and _learn- 
ing/development. One reason for this may be that individuals specializing 
in measurement and special education tend to be concerned with specific 
educational questions and problems upon which psychological theory and 
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TABLE 6 
MEAN PERCENTAGES OF TIME SPENT ON PROFESSIONAL ACTIVITIES 


LD co SE ME Total 
Teaching Sill 48 45 45 48 
Applied: field and 19 31 23 22 23 
community related 
Research 23 is) 24 24 22 


a eas oes per é Feet ih 
Several respondents indicated additionally that administrative activities 
were significant. 


TABLE 7 


PERCENTAGES IDENTIFYING AS “PSYCHOLOGIST” 
OR AS “EDUCATOR” 


LD CO SE ME Total 
Psychologist BIL 49 29 26 45 
Educator 47 42 61 57 47 
No response 4 9 10 Wi 8 


research bear. Counselling and learning/development, however, might tend 
to attract more individuals whose primary interest is in psychology followed 
by its applications in an educational context. 


Conclusions 


The present survey examined educational psychologists employed in 
education faculties organized on a departmental basis only. A limitation 
this has placed upon the sample is the consequent absence of Francophone 
psychologists from the majority of Quebec universities. 


Schmidt’s observation that educational psychologists do not know 
whether to identify as psychologists or as educators is not strictly correct. 
In actual fact only 8% of the present sample were unable to make the choice. 
It is more accurate to say that about half identify as psychologists and half 
as educators. The significance of this fact becomes apparent when one 
considers the diversity of professional interests and activities represented 
by educational psychologists as a whole. Quite clearly there is no single 
factor which brings them together other than a common concern with the 
application of psychological principles to education. 


Virtually every area of psychology has had some application in 
educational settings and has potential educational significance. Thus the 
once popular definition of educational psychology as “that part of 
psychology that teachers ought to know” has little meaning (Nixon, 1976). 
Rather, educational psychology is better understood as a process rather 
than a content area. Psychology becomes educational psychology when its 
principles are applied to educational problems, and this is the function of 
educational psychologists. 
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KEVIN MARJORIBANKS 
University of Adelaide 


Personality and Environmental Correlates of 
Cognitive Performance and School Related 
Affective Characteristics: A Regression Surface 
Analysis 


Regression surface analysis was used to examine relationships between 
children’s personality and measures of cognitive and school related 
affective characteristics at different levels of family and_ school 
environments. Data were collected on 250 12-year-old children and their 
parents. Regression surfaces were constructed from models which 
examined possible linear, curvilinear and interaction relations between the 
variables. The Jackknife technique was used to adjust the significance 
levels in the analysis. Results indicate that there are significant sex 
differences in the relations between personality and measures of cognitive 
and affective characteristics at different environment levels. A number of 
general propositions are suggested, for example: that changes in 
personality are associated with changes in the intelligence test scores of 
girls at different levels of the family environment, but not related for boys; 
and that at each level of personality, increases in the favourableness of 
family environments are related to increases in the intelligence test scores 
for boys, but not for girls. A set of propositions are suggested for the 
relations between personality, family and school environments, and the 
affective characteristics of children. (Dr. Marjoribanks is a Professor in the 
Department of Education at the University of Adelaide, South Australia.) 


Although there have been many investigations into the relationships 
between the personality of children who are of late elementary or early 
secondary school age and measures of cognitive performance and school 
related affective characteristics, the findings remain inconsistent and 
inconclusive. For example, studies of British children by Jones (1960), 
Rushton (1966), Savage (1966), Ridding (1967), Eysenck and Cookson (1969), 
and Savage and Savage (1973) suggest that the extraverted child tends to 
perform better than the introverted child on measures of cognitive 
performance. The finding has been supported in Uganda by Honess and 
Kline (1974) and in a study by Orpen (1976) of Xhosa-speaking Blacks and 
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Afrikaans-speaking Whites in South Africa. Entwistle and Welsh (1969) 
found that among bright boys, extraversion was negatively related to 
academic attainment but for low ability boys, the relation was positive, 
while Entwistle and Cunningham (1968) found that girls who were stable 
extraverts and boys who were stable introverts formed superior academic 
groups. Frost (1968) proposes that for girls there is a positive relationship 
between extraversion and attainment but for boys the relation is negative 
which has the effect of producing an overall U-shaped relationship between 
extraversion and attainment. No significant relations between extraversion 
and scores on Raven’s Progressive Matrices are reported by Orme (1975) 
while Jensen (1973) found that, while extraversion had negligible 
correlations with intelligence, it had low but significant correlations with 
the academic achievement scores of white, Negro, and Mexican-American 
children. A high correlation between extraversion and reading age was 
found by Elliott (1972) but there was a negligible association between 
extraversion and intelligence when mental age and reading age were held 
constant. 


For the relation between neuroticism and cognitive performance, Savage 
(1966), Ridding (1967), Honess and Kline (1974) found no association while 
Callard and Goodfellow (1962), Rushton (1966), Entwistle and Cunningham 
(1968), and Jensen (1973) found significant negative linear correlations. 
Eysenck and Cookson (1969) propose that the relation between neuroticism 
and performance scores is negative with the regression being slightly 
curvilinear while Brown (1970) found a definite curvilinear relation. But 
Entwistle and Cunningham (1968) conclude that the relationship between 
neuroticism and attainment is linear and that the hypothesis of a nonlinear 
relation must be rejected. 

Studies relating personality and other school related affective measures 
are less numerous than those linking personality and cognitive 
performance. Instead of examining relations between personality and 
affective measures, studies generally use both sets of measures as predictors 
of achievement performance (e.g., see Cattell, Sealy & Sweney, 1966; Cattell 
& Butcher, 1968; Radcliffe, 1972). In a study of English secondary school 
children, Banks and Finlayson (1973) found a weak relationship supporting 
the proposition that, compared with extravert children, introverts have 
more interest in books and intellectual problems and are more able and 
willing to persist in working at an academic task. Using the Cattell 
Personality Questionnaire (HSPQ), Josephs and Smithers (1975) found that 
students who had more positive attitudes to school tended to be more 
conservative (Q,), controlled (Q3), conscientious and persistent (G), shy and 
sober (H), practical (M), somewhat more sober (F), more group dependent 
(Q>) and more apprehensive (O). As the authors suggest, no clear picture 
emerges in the relation between the constructs of extraversion and 
neuroticism and school related attitudes. Jackson and Getzels (1959) 
conclude that school related attitudes are a reflection of basic personality 
variables and are unrelated to intellectual ability and academic 
achievement while Williams (1970) found that students with positive school 
related affective measures obtained significantly higher scores on a set of 
ability, achievement, and personality variables. 

The findings in the research remain equivocal partly because the studies 
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have relied on the use of restricted statistical techniques such as 
product-moment correlations which reveal only bivariate relations and 
analysis of variance techniques which require the grouping of variables into 
levels. Simple additive multiple regression equations have been used to 
examine how much extra variance in cognitive and affective scores is 
associated with the addition of personality measures to the equations. The 
results generated from these equations depend on the order in which 
personality measures are added to the equations. All of these statistical 
techniques have prevented the full character of the relationships between 
personality and measures of affective characteristics and cognitive 
performance from being revealed. 


The inconsistencies in the findings are related also to the failure of most 
of the studies to examine the relations between personality and measures of 
cognitive performance and_ affective characteristics at different 
environment levels. As Lavin (1965, p. 100) proposes, the ‘disappointing 
state of affairs’ in the research may be due to the circumstance that 
virtually all studies “conceive of the individual as if he were operating in a 
social vacuum.” Similarly, Getzels (1969) suggests that research relating 
personality characteristics and achievement “might be more powerful if the 
social setting in which educational performance takes place where 
conceptualized and used as a significant variable with which personality 
interacts” (p. 100). In the present study an attempt is made to overcome 
some of the conceptual and statistical limitations of previous research. 
Regression surfaces have been plotted to investigate relations between 
personality and measures of cognitive performance and school related 
affective characteristics, at different levels of family and school learning 
environments. 


Method 

Sample 

Approximately 550 Australian 12-year-old children were tested to obtain 
measures of personality, school related affective characteristics, perceptions 
of classroom and school learning environments and intelligence. Family 
environment data were collected from interviews with the parents of 250 
children, 130 boys and 120 girls. Where possible two parallel pools of 
families, based on social status background, were formed. The purpose of 
the substitute pool was to provide a set of alternate families which could be 
used in the study if families from the first pool did not agree to participate. 
The final sample included 250 children and their parents. 


Measures 


Family environment. From a set of studies which had examined the 
relations between the environments of families and the cognitive and 
affective characteristics of children (e.g., Rosen, 1959; Dave, 1963; Wolf, 
1964; Plowden, 1967; Weiss, 1974; Marjoribanks, 1972; Walberg & 
Marjoribanks, 1973), a set of six environmental process variables were 
identified. These variables were labeled as parents’: expectations for the 
child, expectations for themselves, concern for the use of language within 
the family, reinforcement of educational expectations, knowledge of child’s 
educational progress, and the family involvement in educational activities. 
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The process variables provided the framework for the construction of an 
environmental measure which was an adaptation of a previous instrument 
(see Marjoribanks, 1972). The measure which was in the form of a 
semistructured home interview schedule was used to elicit responses from 
parents. For each item a set of alternate responses was supplied to the 
interviewer. In addition, an “other answer” space was provided so that the 
interviewer could record a response which was not covered by those 
supplied. As well as obtaining a measure of the intensity of the present 
learning environment operating in the family, the schedule attempted to 
gain a measure of the cumulative nature of the environment. For example, 
as well as asking, “How much education do you expect the child to 
recelve?’’, the schedule also included a question to estimate how long the 
expectations had been held. The scales had theta reliability estimates 
greater than .80 (see Armor, 1974). Scores on the scales were analyzed using 
principal component analysis and from the six environment scales, two 
environment factors were identified which were labeled: parents’ press for 
achievement (loaded on parents’ expectations for the child, expectations for 
themselves, concern for the use of language in the home, reinforcement of 
expectations) and parent-child involvement (loaded on knowledge of child’s 
educational progress and the family involvement in educational activities). 
Factor scores were obtained on the two family environment factors for each 
child. 


School Environment. A new school environment schedule was developed 
by selecting items from instruments constructed by Stern (1963), Walberg 
(1972), and Trickett and Moos (1973). The items which were selected had 
been shown to have moderate to high predictive validities in relation to 
cognitive and affective measures. The new scale is of a Likert-type format 
consisting of 50 items which attempt to assess children’s perceptions of 
classroom and school learning environments. Scores on the items were 
analyzed using principal component analysis. After eliminating items with 
small factor loadings (<.40) the remaining items were refactored, 
producing two factor scales. The scales which were labeled the intellectual 
orientation of the school and the punitive nature of the school environment 
both had theta reliability estimates of approximately .85. 


The intellectual orientation scale consisted of 14 items which were of the 
form: new ideas are met with immediate enthusiasm in this school, teachers 
here put a lot of energy and enthusiasm into their teaching, teachers 
encourage students to think about exciting and unusual careers, and in this 
school teachers go out of their way to help you. Included in the punitive 
environment scale were 16 items of the form: students in this school are 
sometimes punished without knowing the reason for it, in this school 
teachers very often make you feel like a child, students are often made to 
take the blame for things whether they did them or not, those in charge of 
this school are not very patient with students, and sometimes teachers 
embarrass students in my classes for not knowing the right answers to 
questions. An intellectual orientation of the school and a punitive school 
environment score for each child was obtained by summing the scores on 
the items which made up the two factors. 


Cognitive Performance. The general intellectual ability of the children 
was assessed using the Otis Intermediate Test - Form AB, which has 
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acceptable reliability estimates and high predictive validities in relation to 
other academic achievement measures. 


Personality. The CPQ, Form A devised by Cattell (see Cattell, Eber & 
Tatsuoka, 1970) was used to assess children’s personality. Data on 14 
personality constructs are provided by the schedule. When the scores on the 
variables were factor analyzed and the factors rotated, four second stratum 
personality factors were identified. There is a modest correspondence 
between these four factors and second stratum factors isolated in previous 
studies (e.g., see Gorsuch & Cattell, 1967; Warburton, 1972; Nesselroade & 
Baltes, 1975; Howarth, 1976). The factors were labeled: 
contemplative-impatience (loaded strongly on conscientious [G‘* |, dependent 
[I*], apprehensive [O*], controlled [Q,°]); self-reliant-subduedness 
(excitability [D*], assertive [E*], circumspect individualism [J* ], shrewdness 
[N*], high ergic tension [Q,°], enthusiastic [F’]); adjustment-anxiety 
(participating [A*], emotionally stable [C*], self-assured [O7], adventurous 
[H*]); and extraversion-introversion (aggressive [E*], enthusiastic [F’], 
worldly [N’]). Factor scores were obtained on the four second stratum 
personality factors for each child. 


School Related Affective Characteristics. Measures constructed by 
Barker Lunn (1969, 1970) and Summer (1972) and modified for the present 
study were used to assess children’s school related affective characteristics. 
Both scales are of a Likert-type format and each consists of 50 items. Factor 
scaling techniques were used to examine the responses to the two schedules. 
From the first schedule five factor scales were isolated which were identified 
as: enthusiasm for school, academic self-concept, loneliness at school, 
enthusiasm for disruptive behaviour, and children’s educational and 
occupational expectations. From the second schedule, two factors were 
identified and labeled: alienation from school and child’s academic 
orientation. The scales had theta reliabilities ranging from .70 to .87. From 
the seven affective factors, two second stratum factors were isolated and 
named: child’s commitment to school (loaded on enthusiasm for school’ , 
enthusiasm for disruptive behaviour , educational and occupational 
expectations’ , alienation from school , academic orientation*) and school 
self-confidence (academic self-concept*, loneliness at school”). Factor scores 
were obtained for each child on the two second stratum school related 
affective factors. 


Therefore the study examined relations between personality 
(contemplative-impatience, self-reliant-subduedness, adjustment-anxiety, 
extraversion-introversion) and measures of intelligence and school related 
affective characteristics (commitment to school, school self-confidence) at 
different levels of the family environment (parents’ press for achievement, 
parent-child involvement) and school environment (intellectual orientation 
of the school, punitive nature of school environment). 


Results 
The first investigation of the relations between the variables involved an 
analysis of zero-order correlations. In Table 1, the results show that there 
are sex differences in the relationships between the personality and 
environment measures and the intelligence and affective scores. For 
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TABLE 1 
ZERO-ORDER CORRELATIONS BETWEEN PERSONALITY, ENVIRONMENT 
AND MEASURES OF INTELLIGENCE AND SCHOOL RELATED 
AFFECTIVE CHARACTERISTICS 


Affective Characteristics 


Intelligence Commitment Self-Confidence 
to School in School 
Personality 
Contemplative -04 (a3) 30** (49%*) -15 (-06) 
Self-Reliant onl (21*) -45%** (-21%) Soe (-05) 
Adjustment 08 (33**) 22kk = (33**) 65** (53**) 
Extraversion 2 (32%**) -28%* (-20%*) 17* (22%*) 
Family Environment 
Parents’ Press for Achievement Sikes — (IL 7/)) 32** (09) 17* (08) 
Parent-Child Involvement 35%** (-15) 27%* (-04) 7s (-22*) 
School Environment 
Intellectual Orientation of 
Seca -16 (-07) 38%** (40**) 15 (-02) 
ear ee 
Non-Punitive Nature of 01 (10) 4gek (37%) 28k (20%) 


School 


Note. Decimal points have been omitted and coefficients for girls 
are in parentheses. 


- Pp < .05 
ee O01 


example, while the boys’ personality scores are not related to intelligence, 
three of the personality factors have significant low to moderate 
relationships with girls’ intelligence test scores. Also, the family 
environments of boys have moderate concurrent validities in relation to 
boys’ affective and intelligence scores while for girls only one relationship 
reaches statistical significance. The zero-order correlations suggest that 
children who tend to be contemplative, subdued, adjusted, and introvert 
have a greater commitment to school than children who are more patient, 
self-reliant, anxious, and extravert. As might be expected, children who 
express a positive commitment to school perceive the school as having an 
intellectual orientation and being nonpunitive. Also, children who indicate 
a high self-confidence in school tend to be well adjusted, extravert, and 
perceive the school environment to be nonpunitive. 


Relationships between the measures were investigated further by 
plotting regression surfaces using raw regression weights generated from 
regression models which included terms to account for possible linear, 
curvilinear, and interaction relations. Because prior research has produced 
conflicting evidence regarding sex differences in relations between 
personality and measures of intelligence and affective characteristics, and 
as the zero-order correlations suggested some sex differences in the 
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TABLE 2 


RAW REGRESSION WEIGHTS FOR MULTIPLE REGRESSION OF 
INTELLIGENCE AND SCHOOL RELATED AFFECTIVE CHARACTERISTICS 
ON PERSONALITY AND FAMILY ENVIRONMENT 


Criterion Variables 


Commitment Self-Confidence 

Predictor Variables Intelligence to School in School 
Parents' Press for 

Achievement 3.760** (1.450) 362** (.098) 094 (.048) 
Contemplative .848 (.885) 2515** (.465%**) 144 (-.028) 
Multiple R 39 (225) 743 (50) 5 Pe (2 1Me)) 
Press for Achievement Bn f47s* (e427) BS we (ell 2) - 084 (.048) 
Self-Reliant 298 G2625) 600** (-.185*) 137% (~.021) 
Multiple R 39 G27) 54 (aa 26 (.10) 
Press for Achievement Be 75 9%% C410) 385** (.098) . LOO* (.042) 
Adjustment Moles 5) (2.791%*) -474% (.400**) .666%* (.308*%*) 
Multiple R -40 (337) 739 (34) .67 (G55) 
Press for Achievement 3.648** (1.662) 420** (.091) 080 (COs) 
Extraversion IE toy} (4.494%*) .887** (-.365) Dit © 233) 
Multiple R .39 (.38) 45 (®) pig (.24) 
Parent-Child Involvement 6.024** (-2.406) Seeales (-.195) 164 (-. 206) 
Contemplative 866 (1.047) 516** (.478%**) 5 ALAS (-.015) 
Multiple R .36 (22) 39 (.50) A ase 
Parent-Child Involvement 6.041** (-2.321) 465* (~.047) 136 (-. 208%) 
Self-Reliant - 480 (1.345*) 490** (-.182*) .133** (-.015) 
Multiple R 35 Gan 50 (> la £25 (Eee) 
Parent-Child Involvement 6.033** (-1.950) 571** (-.050) . 180% (-.190*) 
Adjustment 1.700 (2.771%*) -478%* (.400** ) .668** (.304%*) 
Multiple R oe! G36) 535) (6 S29) Aa7) (E57) 
Parent-Child Involvement 6.026** (-1.771) 539%** (-.106) .160 (-.193*) 
Extraversion DVT (4.164%**) 760** (-.383%) ONE (.191%*) 
Multiple R . 38 ou, 39 (20) ake (.30) 

Note. Figures for girls are in parentheses. 


* 
*&* 


Multiple R is not significant at 


-05 level. 


Value of raw regression weight exceeds twice its standard error. 
Value of raw regression weight exceeds three times its standard error. 


relations, the regression surface analysis was conducted separately for girls 


and boys. 


In the study, a simple random sample was not used. Therefore, in the 


regression analysis the Design Effect for each raw regression weight was 
estimated using the Jackknife technique (Mosteller & Tukey, 1968; Finifter, 
1972). The significance levels for the regression weights were recalculated 
using the formula: standard error for sample estimate = (Design Effect)” 
simple random sample standard error (Kish, 1965; Ross, 1976). The 
Jackknife estimates led to a reduction in the number of regression weights 
that were significant. Therefore, a second stage of the regression analysis 
was conducted in which variables that no longer had significant relations 
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TABLE 3 


RAW REGRESSION WEIGHTS FOR MULTIPLE REGRESSION OF 
INTELLIGENCE AND SCHOOL RELATED AFFECTIVE CHARACTERISTICS 
ON PERSONALITY AND SCHOOL ENVIRONMENT 


Criterion Variables 


Commitment Self-Confidence 

Predictor Variables Intelligence to School in School 
Non-Punitive School 

Environment 029 (6dtO7)) ~127** (.061**) -O44** (.027%) 
Contemplative +605 (.782) 382% (.397%**) -.193% (~.058) 
Multiple R .04 (Gaul 5») 4 5)8) (54) Bee) G28) 
Non-Punitive Environment 018 (5 267) -106** (.082%**) .032% (1023) 
Self-Reliant 5 OS (0s) -—.442** (-.114) -.087 (001) 
Multiple R nice (.26) 57 (.39) .30 C219) 4 
Non-Punitive Environment -.023 (s057)) Ses 2 eee CO 754) .024* (sila) 
Adjustment 1.495 (2.765%) .298 (.328%**) -632%* (.298%*) 
Multiple R 08 (.33) 50 (.46) 67 (.54) 
Non-Punitive Environment plead; (@238)) -125** (.085**) sO51%*5 (027%) 
Extraversion 3.145 (4.549%**) -,382 (-. 289) oclsx (233%) 
Multiple R lS (435) 0) (.400) 38 (532) 
Intellectual Orientation 

of School -.498 (-. 250) sess (C50) 52) 029 (.001) 
Contemplative -.311 (akg LS») -478%* (.384**) ~.152 (-.027) 
Multiple R silks (te) 46 (155) 522 (.06) 
Intellectual Orientation -.5(@9 (-.040) pLIS**e (GE w03*%) 022 (-.004) 
Self-Reliant -.162 (Cle 2S 72) -.568** (-,111) -.131%* (~.023) 
Multiple R sage 2) 55 (.42) 25 (.06) 
Intellectual Orientation —-.517 (-.292) ~129%** (.096**) HORM (-.019) 
Adjustment 1.608 (3.084%*) -420%* (5 SSeS) -655** (.326%**) 
Multiple R .19 (. 36) s43 Ca) .66 C455) 
Intellectual Orientation -.440 (-.008) SlLUSs <a Co Los**) .037* (.004) 
Extraversion 1.825 (4.300%*) -.533% (-.232) 298% (.212%) 
Multiple R 119 (.32) 42 (.41) 26 (9) 


Note. Figures for girls are in parentheses. 


* Value of raw regression weight exceeds twice its standard error. 
** Value of raw regression weight exceeds three times its standard error. 


= Multiple R not significant at .05 level. 


with the measures of intelligence and affective characteristics were deleted 
from the regression models. In the second analysis, the Jackknife technique 
was used again to estimate Design Effects and thus to adjust further the 
significance levels of the regression weights. 


Regression surfaces were generated from equations including the raw 
regression weights shown in Tables 2 and 3. It was found that the two-term 
equations accounted for as much significant variance in the intelligence 
and affective measures as complex many-term equations containing 
quadratic terms (to test for nonlinearity) and product terms (to test for 
interactions). In these more complex models, the raw regression weights 
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between the intelligence and affective scores and the quadratic and product 
predictor terms were not significant after adjusting for Design Effects. 


As it is not possible in the space available to present all the regression 
surfaces that were generated from the data, four surfaces have been plotted 
in two figures. The surfaces reflect some of the different relations that are 
present among the variables. For the construction of the surface the scores 
on each variable were converted to standard scores with a mean of 50 anda 
standard deviation of 10. In Figure 1, the surfaces show the regression-fitted 
relations between the adjustment-anxiety personality factor and 
intelligence at different levels of parents’ press for achievement. The shape 
of the boys’ regression surface reflects those relations in Tables 2 and 3 in 
which the personality factors are not related to the intelligence and 
affective measures at different environment levels. The girls’ surface 
represents regression models in which the personality factors are associated 
with intelligence and the affective measures at different environment levels, 
but where the measures of intelligence and affective characteristics are not 
related to the environment scores at different levels of the personality 
factors. For example, in Figure 1 at each level of the adjustment-anxiety 
factor, increments in parents’ press for achievement for boys are associated 
with sizeable increases in the regression-estimated intelligence test scores. 
But at different levels of press for achievement, changes in boys’ 
personality from anxiety to adjustment are not associated with changes in 
intelligence scores. The girls’ surface shows a different set of relations. At 
each level of the anxiety-adjustment factor, increments in parents’ press for 
achievement are not associated with increments in intelligence. But at each 
family environment level, as the girls’ personality changes from anxiety to 
adjusted, there are increments in the regression-estimated intelligence 
scores. 


In Figure 2 the surfaces show the regression-fitted relations between the 
self-reliant-subduedness personality factor and commitment to school, at 


INTELLIGENCE 


repression 


girls! 
regression surface 
surface 

40 

high press 40 Adjustment 

for achievement 40 30 30 

low press Anxlety 
for achievement 
FAMILY ENV!RONMENT PERSONALITY FACTOR 
Figure 1. Fitted-intelligence scores in relation to the anxiety-adjustment factor 


and parents’ press for achievement. 
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COMMITMENT TO SCHOOL 


girls! 
regression 
surface 


boys! 
regress!lon40 
surface 


30 
60 Subduedness 


Intellectual Self Reliant 
Ortentation Non-Intellectual 
Ortentation 
SCHOOL ENVIRONMENT PERSONALITY FACTOR 
Figure 2. Fitted-commitment to school scores in relation to the 


self-reliant-subduedness factor and children’s perception of the 
intellectual orientation of the school. 


different levels of children’s perception of the intellectual orientation of the 
school environment. The boys’ surface reflects those regression equations in 
which intelligence or the affective scores are related to both personality 
factors and environment scores. The surface shows that boys who are 
self-reliant (excitable, assertive, circumspect individualistic, shrewd, 
enthusiastic, high ergic tension) and who perceive the school to have a 
non-intellectual orientation exhibit a low level of commitment to school. 
That is, they express little enthusiasm for school, enthusiasm for disruptive 
behaviour, low educational and occupational expectations, alienation from 
school, and low academic orientation. If self-reliant boys perceive the school 
as having an intellectual orientation, their commitment to school increases. 
At each level of boys’ perception of the intellectual orientation of the school, 
changes in the personality factor from self-reliant to subduedness are 
associated with a more positive commitment to school. But even at a high 
level of subduedness, if boys perceive the school to have a non-intellectual 
orientation, then commitment to school remains relatively depressed. 
Again, the girls’ surface indicates the sex-related differences in the 
associations among the _ variables. At each level of the 
self-reliant-subduedness factor, changes in the girls’ perceptions of the 
school environment are associated with changes in their commitment to 
school, which is a relation similar to that of the boys. But at each 
environment level, changes in girls’ personality from self-rehant to 
subduedness are not related to changes in their commitment to school. 


Discussion 
By using regression models to generate regression surfaces and by 
including measures of family and school environments, the present study 
has gone beyond much of previous research which has examined relations 
between personality and measures of cognitive performance and school 
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related affective characteristics. From the findings a number of general 
propositions are suggested. Because of the important sex differences that 
were present in the results, the propositions are stated separately for boys 
and girls. For boys the results suggest that: (a) at each level of family 
environment, changes in personality factors are not associated with 
changes in intelligence scores; (b) at each level of personality, increases in 
the favourableness of the family environment are related to increments in 
intelligence; (c) commitment to school is related to both personality and 
environment measures. That is, boys who (i) have family environments 
which exert a high press for achievement and exhibit a high amount of 
parent-child involvement and (ii) perceive the school environment to have 
an intellectual orientation and to be nonpunitive, may not necessarily 
express a higher commitment to schools than boys who: (i) have families 
exerting little press for achievement and low parent-child involvement and 
(ii) perceive the school as having a non-intellectual orientation and as being 
punitive. But if the supportive family environment and the positive 
perceptions of school are associated with personality characteristics defined 
as contemplative, subdued, adjusted and introvert, boys are likely to 
indicate a high commitment to school; and (d) boys who perceive the school 
as being a nonpunitive environment and who are extravert, impatient, and 
adjusted indicate a high self-confidence in school while boys who perceive 
the school environment as being punitive and who are introvert, 
contemplative, and anxious exhibit low self-confidence in school. 


For girls, the findings suggest the general propositions that: (a) at each 
level of family environment, changes in personality are associated with 
changes in intelligence test scores; (b) at each level of personality, changes 
in the family environment are not associated with changes in intelligence 
scores; (c) commitment to school is related to the personality factors and 
perceptions of the school environment but not to the family environment. In 
general, at any level of the family environment a high commitment to 
school is associated with girls who are contemplative, well adjusted, and 
introvert. Also, girls who are well adjusted and contemplative and who 
perceive the school as being nonpunitive and intellectually oriented have a 
very positive commitment to school, while girls who are anxious and 
impatient and who perceive the school as being punitive and 
non-intellectually oriented have a relatively low commitment to school; and 
(d) girls who are well-adjusted rather than anxious have a higher level of 
self-confidence in school. 


The inclusion of environmental variables in the analysis of the relations 
between personality and measures of cognitive and affective characteristics 
helps to explain some of the inconsistencies in previous research, especially 
the equivocal findings that have been found for girls and boys. What is 
required now is a set of studies which investigate relationships between 
personality and other measures of cognitive performance and affective 
characteristics at different environment levels, for girls and boys of 
different age groups. Only when such research is completed will we have a 
real understanding of the complex nature of the relations between 
personality and measures of cognitive and affective characteristics. 
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Basic Dimensions in Characteristics 
of Classroom Groups 


A diverse battery of individual and group measures was obtained for 144 
high school physics classes. Several sets of factors were identified by factor 
analysis of the battery, one of which was a predetermined rotation based 
on Bales’ model of interpersonal behaviour in groups. The model-based 
rotation, comprising three factors named Achievement, Cohesiveness, and 
Status, compared favourably to principal components and Varimax 
rotation with respect to interpretability and variance accounted for. 
However, the roles played in the factor structure by measures of 
authoritarianism and interest in subject matter suggest that some 
modification of the Bales dimensions may be appropriate for describing 
classroom groups as wholes. (Dr. Ahigren is an Associate Professor of 
Curriculum and Instruction, Center for Educational Development, 
University of Minnesota, Twin Cities; Dr. Walberg is a Professor of 
Education, College of Education, University of Illinois at Chicago Circle.) 


Bales (1968) has described a three-dimensional model of interpersonal 
behaviour, which he derived from replicated factor-analytic studies of 
behaviour observations in small groups. An attempt is made in the present 
study to show that a very similar set of three dimensions is successful in 
characterizing groups as wholes. The data are class means, from a national 
sample of high school physics classes, on perceptions of classroom climate 
and on a variety of academic, biographical, and personality variables. The 
success of fitting the model to the data will be evaluated on the basis of (1) 
the model’s parsimony in accounting for variance in a large battery of class 
variables, and (2) the relatedness of the three hypothesized dimensions to 
other educational variables. 


In Bales’ studies, interpersonal interactions within small groups are 
observed and classified by nonparticipants. Three factors that represent 
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patterns of observations are computed for each individual in the group. 
Each individual in a group can be represented by a point inside a cube 
defined by these three dimensions. The up-down dimension Bales calls 
“power’’; it represents the degree of individual dominance, prominence, or 
status in the group. A person with high group status may be metaphorically 
understood: as being high in this vertical dimension, near the top of the 
cube. The left-right dimension is “affection”; acceptance and affiliation are 
associated with an individual located near the right side of the cube, while 
rejection and hostility are associated with an individual located near the 
left side. The forward-backward dimension is “achievement,” the extent to 
which the individual promotes group tasks and achieves group goals. A 
person who occupies a forward position conforms to group norms and helps 
the group to accomplish its tasks; a person in a backward position prevents 
the group from achieving. In Bales’ model, the relations between any two 
people in the group are determined in large part by their relative positions 
in the cube. 


An important aspect of the Bales model is the independence of the three 
dimensions: the position of a group member on one dimension should not be 
predictable from his or her position on the other dimensions. Thus, for 
example, a person who advances group goals may engender either 
cohesiveness or hostility in the group and may be either high or low in 
status. 


Bales cites evidence for the three dimensions in repeated factor analyses 
of the frequencies of categories of observed individual behaviour within 
small groups. Although Bales’ instruments have often been used within 
classroom groups, his model apparently has not been used to characterize 
groups as wholes. The present study uses as units of analysis the group 
means — specifically, the class means of a national sample of 144 high 
school physics classes (containing about 3,700 students). Needless to say, 
the correlation structure for group means may be quite unlike that for 
individuals within groups. 

The context of the present research, then, is fundamentally different 
from the work of Bales. His units of analysis were individuals in the same 
group who interacted with one another; and the intragroup measures were 
ipsative — high and low scores are relative to the scores of interacting 
members of the same group. The units of analysis in the present study were 
noninteractive and represent the pooled perceptions of all members of each 
group. Accordingly, the Bales dimensions shift somewhat in meaning when 
group means are considered. Status is not relative status within a group, but 
group emphasis on status, interpersonal affect becomes a generalized group 
affective tone, and achievement becomes group emphasis on achievement. 


(The teacher, by virtue of his special role, is not properly a perceiver of 
the learning climate. He or she is considered as an external constraint, 
rather than a member of the interacting learning group. Examination of the 
items comprised by each scale will confirm that the teacher is not 
considered in competition for “status,” or a receiver of “favouritism,” etc.) 


The present study also differs from the work of Bales in the 
incorporation of several diverse domains of measurement — personality, 
biographical, and academic. Substantial contributions of these concrete 
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variables in the factor patterns provide an economical way of 
demonstrating the pertinence of the dimensions of group perception to 
education processes. 


Method 
Sample Characteristics 


The opportunity for the study arose in the evaluation of a 
government-funded curriculum project, Harvard Project Physics. Data were 
collected from the current classes of two sets of teachers: (1) 36 teachers in a 
simple random sample from the National Science Teachers Association’s 
list of about 16,000 high school physics teachers in the continental U.S. who 
had agreed to teach a new course, Harvard Project Physics (for details of 
the sampling procedure, see Walberg, Welch & Ahlgren, 1968) and (2) 21 
teachers who had volunteered to teach the course in previous years. 
Teachers in both groups were more likely to teach in large, advantaged 
school districts (where there are more physics teachers). Mean IQ’s for 
students in both groups were 115, with a standard deviation of 15. The 
sample thus consisted almost entirely of students with above-average 
intelligence and the academic motivation to take a physics course, which is 
generally regarded as one of the most difficult subjects in high school. 
(What may be missing in group diversity may have been gained in 
conscientious responses to the instruments.) Many teachers had more than 
one Harvard Project Physics class, and the total number of classes in the 
study was 144. Class size ranged from 19 to 32, with a mean of 25. 


Group Measures 


The set of 57 variables is listed in outline in Table 1. The perceived class 
properties were the class means on the 14 scores of the Learning 
Environment Inventory, a measure of the social environment of learning. 
(See Walberg, 1969a, for a rationale of the instrument and validities for 


TABLE 1 

VARIABLES INCLUDED IN FACTOR ANALYSIS 
Learning Environment Personality Biographical Academic (Pre and Post) 
Intimacy Dogmatism Books in Home Cognitive 
Friction Authoritarianism Nonschool Reading Science Understanding 
Cliqueness Rigidity Nonschool Study Science Processes 
Satisfaction Need Achievement Opportunity for Creativity Physics Achievement 
Speed Need Order Career Discussion with Adults 
Difficulty Need Affiliation Like School Noncognitive 
Apathy Need Change School Marks Science Interest 
Favouritism Perfectionism Physics Activities 
Formality IQ Perseverance Physics Interest 
Goal Direction Importance of Intelligence 
Democracy Question Teachers 
Disorganization Creative Imagination 
Diversity Group Leader 
Environment Visual Artist 


Scientist 
Performing Artist 
Musician 
Writer 
Senior 
Plan College Entrance 
Class Size 
Class Size--squared 
Girl Ratio 
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TABLE 2 


EXAMPLE ITEMS FOR NINE SCORES FROM THE 
LEARNING ENVIRONMENT INVENTORY 


Score Example Item (from the 7 comprising the score) 
Friction "Students are constantly challenged." 
Cliqueness "Certain students work only with their close friends." 
Satisfaction "The students enjoy their class work." 
Speed "The pace of the class is rushed." 
Difficulty "Students are constantly challenged." 
Apathy "Failure of the class would mean nothing to most members." 
Favouritism "The better students are granted special privileges." 
Formality "There is a recognized right and wrong way of going 


about class activities. 


Democracy "Each member of the class has as much influence as any 
other member." 


predicting learning.) Seven items describing a class compose each score; for 
each item the student is asked to indicate (on a 4-point scale) his agreement 
or disagreement on how well a statement describes his class. The mean of 
the seven item ratings is the individual’s score, and the mean of the scores 
for the students in a class is the score for the class. The individual and class 
alpha reliabilities for the scores range from 0.6 to 0.8. Example items for 
most of the scores appear in Table 2. 


Personality Measures and IQ 


Class means on seven personality scores were used: three adapted from 
the work of Rokeach and others (1960) — Dogmatism, Authoritarianism, 
and Rigidity, and four from the Edwards Personal Preference Schedule 
(1959) — the needs for Achievement, Order, Affiliation, and Change. Items 
constituting the scores were reproduced in a booklet, and the students were 
asked to express agreement or disagreement with each on a 4-point scale. 
The individual alpha reliabilities of the scales range from 0.6 to 0.8. 

Mean class IQ was computed from individual scores on _ the 
Henmon-Nelson Test of Mental Ability (Form B), which has a 
Kuder-Richardson Formula 20 reliability of .91 as reported in the Manual. 


Biographical Inventory 

Twenty items were selected from an instrument developed by Taylor and 
Ellison (1967) to measure creative potential. Parent forms of the instrument 
showed substantial correlations (.4 to .6) with assessments of creativity 
such as peer and supervisor ratings. The multiple-choice items relate to 
components of the student’s home environment, school experience, and 
activities. 
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Academic Measures 

Three cognitive and three noncognitive academic variables were 
administered both as pretests and posttests. The first cognitive measure, 
Science Understanding, is the total score on the Test on Understanding 
Science (Cooley & Klopfer, 1961). The TOUS is a 60-item, multiple-choice 
test on the nature of science, science as an institution, and scientists as 
people; its Kuder-Richardson Formula 20 reliability is .76 as reported in the 
Manual. The second cognitive measure, Science Processes, is the total score 
on the Welch Science Process Inventory (Welch & Pella, 1967); the WSPIJis a 
135-item, multiple-choice test on the assumptions, activities, products, and 
ethics of science and has a reported KR 20 reliability of .86. The third 
cognitive measure, Physics Achievement, is the total score on Physics 
Achievement Test, a locally constructed, 36-item, multiple-choice test of 
general physics knowledge; the PAT had a KR 20 reliability of .77 for 
physics students in the random sample. 


The first noncognitive measure, Science Interest, is a subscore of the 
Academic Interest Measure (Halpern, 1965). It consists of items describing 
science activities, and the respondent is asked to indicate which he would 
like to do; the total score has a KR 20 reliability of .91. The second 
noncognitive measure, Physics Activities, is a report of voluntary 
participation in physics activities during the past year, a subtotal of the 
items having to do with physics on the Pupil Activity Inventory (Cooley & 
Reed, 1961); the split-half reliability with the Spearman-Brown correction is 
reported as .80. The third noncognitive measure is a composite of semantic 
differential ratings of the concept “Physics” obtained on the following 
scales (part of a 5-concept, 15-scale instrument): interesting/dull, 
exciting/boring, and stimulating/monotonous; the alpha reliability of the 
total of these three scales was .86 for physics students in the random 
sample. 


Miscellaneous Variables 


The class size and fraction of girls in each class were also included 
among the variables. Since prior work (Walberg, 1969a) showed class size to 
be quadratically related to group properties, both linear and squared 
class-size terms were entered in the factor analysis. 


The fourteen Learning Environment Inventory scores are prima facie 
group scores (and 12 of them proved to be prominent in the factors described 
below). Many of the other scores (cognitive, interest, and biographical) are 
not intrinsically group scores, but it has been shown elsewhere that they are 
characteristic of groups at least to the extent that the biographical variables 
are predictive of classroom climate (Walberg, 1969a, 1969b), and the 
cognitive and interest variables are predictable from classroom climate 
(Walberg & Ahlgren, 1970). Furthermore, the pretests and the biographical 
variables are, at least potentially, available to administrators for 
consideration in advising students to take courses or in actually assigning 
them to different classes; so class means on these variables are manipulable 
and therefore are pragmatically group properties. 
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Testing Procedure 


The teachers administered the criterion battery on two days during 
September and again during the following May. On the assumption that a 
few months are necessary for classes to mature into social groups, the 
Learning Environment Inventory was administered in November. It was 
assumed that IQ and personality are fairly stable, and these measures also 
were administered in November. The biographical items, also assumed to be 
stable, were administered in March. 


Although all students took the Biographical Inventory, random halves of 
the students in each class took all the other instruments. This sampling 
method, termed ‘randomized data collection” (Walberg & Welch, 1967) can 
be used to increase the number of instruments administered in limited class 
time. Class sets of instruments were prepared in which different 
instruments were ordered randomly; a set was distributed to the class row 
by row, seat by seat, providing a random sample of the students in the 
class. 


Analysis 

A principal components analysis was run on the 57 variables. 
Components larger than 10% of the total variance were rotated orthogonally 
using two methods, Varimax and fixed-position. The commonly used 
Varimax method is designed to simplify the columns of the factor-loading 
matrix so that each factor has a distinctive set of high-loading variables 
and negligible loadings on the rest of the variables. In the fixed-position 
rotation method, the program is designed to rotate the factors to fit as well 
as possible certain variables chosen by the investigator to represent 
hypothesized dimensions. Thus the fixed-position rotation method permits 
testing of a priori theoretical structures, in contrast to automatic 
simple-structure rotations such as Varimax (Couch, 1969; Guilford & 
Hoepfner, 1969). 

Table 3 lists the variable sets with which an attempt was made to elicit 
the three dimensions in the fixed rotation. For “status” the program was 
requested to place a factor through the environment variables Favouritism 
and (-) Democracy. (Sample items on which these variables are based were 
given in Table 2.) An attempt to elicit the “achievement” dimension was 
made by specifying the environment variables Speed and Difficulty and 
posttests on the cognitive academic variables Physics Achievement, Science 
Understanding, and Science Processes. For the “affect” factor, the 
environment variables Satisfaction, (-) Friction, (-) Cliqueness, and (-) 
Apathy were specified, as were the noncognitive academic variables Science 
Interest, Physics Interest, and Physics Activities. The sets of variables used 
to specify the achievement and affect factors have face appropriateness to 
those names, but the choices were based also on canonical correlations 
among environment and academic measures found in previous work 
(Walberg, 1969b). 

Clearly, both Varimax and fixed methods can be useful in different 
research. However, since fixed-position methods are relatively unfamiliar 
and untested, both Varimax and fixed rotations were employed in the 
present study to provide two checks on the model and to provide a 
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TABLE 3 
VARIABLES SPECIFIED TO DEFINE FACTORS IN FIXED ROTATION 


Theoretical Dimension Variables Specified 
"Status" Learning Environment 
Favouritism 


Democracy (-) 


"Achievement" Learning Environment 


Speed 
Difficulty 


Academic -- Posttests 


Science Understanding 
Science Processes 
Physics Achievement 


DATE eC tu Learning Environment 


Satisfaction 
Friction (-) 
Cliqueness (-) 
Apathy (-) 


Academic -- Posttests 


Science Interest 
Physics Activities 
Physics Interest 


comparison between the results of the two methods. Factor analysis by its 
nature identifies independent dimensions, but cannot assure that the 
dimensions are interpretable and meaningful. The central intent was to find 
whether the Bales dimensions provide a good fit to the variables, that is, 
whether they account for a large and interpretable amount of common 
variance. The “goodness of fit” must be relative to something; principal 
components give the maximum variance-accounted-for by a given number 
of factors, and Varimax is the most common rotation method for generating 
interpretable factors. If the fixed factors meet both criteria, that is, if they 
are found to account for nearly as much variance as the same number of 
principal components and if they are as conceptually reasonable as the 
Varimax factors, then the Bales dimensions might be deemed a good fit to 
the data. In interpretation of the factor structure, factor loadings were not to 
be considered if they were less than .40. 


Results 


The principal components analysis produced eighteen factors with latent 
roots greater than 1. The first six latent roots for the three factor sets are 
plotted, in order of size, in Figure 1. The abrupt change in slope that occurs 
after the third root for all three sets justifies consideration of only the first 
three factors. (The rationale for this criterion, whimsically called the “scree” 
test by Cattell, is described in Cattell, 1966, p. 206.) Table 4 displays the 
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Latent Root 


O Principal Components 
O Varimax Rotation 


@ Fixed Rotation 


Figure 1. Largest six latent roots for three factor sets plotted in order of size. 


loadings (of .40 or greater) for all 57 variables on the first three factors of 
each set. (The loadings on variables used to specify the three fixed factors 
are in italics.) Roughly equal amounts of common variance in the battery of 
variables are accounted for by each set of three factors, as can be seen in the 
bottom rows of Table 3: 45% for the three principal components, 36% for the 
Varimax factors, and 36% for the fixed factors. As is commonly found, there 
is a better balance of strength among the factors for the two rotations than 
for the principal components, but with slightly less variance accounted for. 


All three solutions yield roughly similar factors relating to achievement, 
to affect, and to group structure. Because there were no data common to 
Bales’ work and this study, it is not possible to express the match of these 
factors to his quantitatively. The appropriateness of the factors to the Bales 
model must be argued conceptually. 


Discussion 


The strongest factors in each of the three sets are obviously 
“achievement” factors and are rather similar. The second and third factors 
in the several sets are not so unambiguously identifiable as the first. For the 
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TABLE 4 
LOADINGS GREATER THAN 0.40 ON THREE DIFFERENT 
FACTOR SETS 
Principal Varimax Fixed 
Glas sete eke Components Rotation Rotation 
IE 10a Laie I II PED ie JEL alee 
Learning Environment 
Friction - -67 - - - -75 - -40 -66 
Cliqueness - -62 - = - -64 - -46 —49 
Satisfaction - 76 - - - 80 - 42 67 
Speed - - - - ~ = 58 = - 
Difficulty 43 - - 46 - - 76 - - 
Apathy - =85 = = - =i/ 7) - -58 -60 
Favouritism - -48 -50 - - -73 - - -85 
Goal Direction - 65 - - - 75 - ~ 55 
Democracy - BS) - - - Te) ~ - 81 
Disorganization - -65 =54 - - -82 - - -66 
Environment - - 42 - - 51 - - 49 
Personality 
Dogmatism -46 = = = = = a = = 
Authoritarianism -65 - - -49 ~ -47 - - 
Need Order = 4O A = = = = = = 
Need Affiliation = 50 = = is = = = fs, 
IQ 75 - - 80 - - 63 - - 
Biographical 
Books in Home 44 = = = = 2. = = & 
Nonschool Study - - = = 54 = = = = 
Career Discussion = 4l = = 4 = 2 = = 
Like School - 56 - - - - - 41 - 
School Marks 66 - - Sy) - - 50 - = 
Perfectionism om 40 = - = & = 2s = 
Perseverance 50 = = = = ca he = Z 
Creative Imagination 56 = = = = os = = = 
Scientist 51 - - = SY) - = 63 3 
Performing Artist - - -52 = ~ = = = = 
Writer 42 = = = = = = = = 
Plan College Entrance 56 - - Syl - - 43 - - 
Academic -- Pretests 
Science Understanding 1s - - WP - - By - - 
Physics Achievement 7133 ~ - 81 - - 60 - - 
Science Processes 69 - — Wil ~ - 61 - - 
Science Interest - - = = = = = 45 - 
Physics Activities 54 ~ = - - - - 45 - 
Physics Interest - - = = = = = = - 
Academic -- Posttests 
Science Understanding 72 - - 78 - = 76 = = 
Science Processes Tie - - 78 - - 74 - = 
Physics Achievement PP - 2 79 3 si 20 3 = 
Science Interest = 58 - - 82 - - 76 - 
Physics Activities 42 40 -43 ~ 77 = = 72 = 
Physics Interest = 54 - - 81 = - 78 = 
Girl Ratio - 49 - & = = = = = 
Latent Roots: 8.5 Ws8 3.9 6.4 Bia Sid 51.0 Ah 5.0 
76 Common Variance 19% i 9% IGA 8% SA 37, AUB 2% 
(in 18 factors) ———______ 
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characterization of factors below, it is useful to sort the variables on the 
basis of face content into three rough groups: “interpersonal affect,” which 
includes Friction, Satisfaction, and Apathy; “group structure,’ which 
includes Favouritism, Disorganization, Cliqueness, and Democracy; and 
“subject-matter interest,” which includes Physics Interest and Physics 
Activities. 


Principal Components 


The achievement factor (I) in principal components is clearly similar to 
Bales’ achievement dimension. It has strong loadings by both pretests and 
posttests for the three cognitive measures and Physics Activities, by 
perceived class Difficulty, by IQ, and by self-ratings of School Marks, 
Perseverance, Creative Imagination, Scientist, and Plan College Entrance. 

There is, however, one point of inconsistency with the Bales model: the 
Achievement factor has strong negative loadings by Authoritarianism and 
Dogmatism. The negative loading by Authoritarianism is a distinctive 
departure from the intragroup relationship envisioned for it by Bales. Bales 
(1970, p. 395) reports that the factor analysis is ambiguous about 
authoritarianism (it is unrelated to one component of group achievement 
and has a weak positive relation to another), but that it ought properly to be 
associated positively with achievement because commitment to group goals 
implies acceptance of authority. This discrepancy between results of this 
study and Bales’ prediction may be due in part to the restricted variance 
and special nature of the group members in the present study — physics 
students with a high mean IQ. It might be, for example, that among these 
generally high-achievement students authoritarianism is inconsistent with 
understanding science. (It is interesting to note that the correlation between 
group means on Authoritarianism and IQ is -.49.) 

Principal component II includes interpersonal affect variables (e.g., 
Friction, Apathy), and group structure variables (e.g., Democracy, 
Disorganization), some personality variables (e.g., Need Affiliation), some 
biographical variables (e.g., Like School), interest posttests (e.g., Science 
Interest), and Girl Ratio, and so has been labelled “General Affect.” 

Principal component III shows loadings primarily by group structure 
variables (which it shares with principal component II), and on self-rating 
of Performing Artist. This combination suggests that the factor may be 
reflecting classes in which favouritism falling on high-status performers 
leads to some disruption of social structure; on the basis of this speculation, 
principal component III is labelled “Status Disruption.” 


Varimax Rotation 

As is expected in a Varimax rotation, there is a cleaner separation of the 
variables on the Varimax factors than there is on the principal components. 
The Varimax factor I is very similar to principal component I, but has lower 
loadings by Physics Activities and self-ratings of School Marks, 
Perseverance, Creative Imagination, Scientist, and Writer; so it is still 
labelled “Achievement.” The Varimax factor II is loaded almost exclusively 
by posttest subject-matter interest and activities and by self-rating of 
Scientist and so has been labelled “Science Interest.” 
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The Varimax factor III is very similar to the “Group Affect” principal 
component II; it is somewhat more strongly loaded both by group structure 
variables and by interpersonal affect variables, and lacks loadings by 
personality, biographical, or interest variables. It is therefore more clearly a 
group variable and is labelled simply “Social.” 

Neither the principal components Achievement, General Affect, and 
Status Disruption, nor the Varimax factors Achievement, Science Interest, 
and Social, are entirely commensurate with Bales’ achievement, affect, and 
status dimensions. The success of the fixed rotation, however, indicates that 
a more satisfactory match is possible. 


Fixed Factors 

The hypothesized status and achievement fixed factors are clearly 
dominated by the variables through which the program forced them. 
(Achievement shows additional high loadings by pretest achievement 
measures, as well as by the specified posttests.) The hypothesized affect 
factor, however, does not have exclusive loadings by four of its six specified 
variables — Friction, Cliqueness, Satisfaction and Apathy; in fact, these 
loaded more strongly on the status factor. A neat sorting of variables is of 
course not necessary for a satisfactory fit, as long as the dual contribution 
of some variables can be interpreted sensibly. 


Compared to principal component I, the fixed factor I, like Varimax I, is 
loaded more clearly: by cognitive tests and less by interest measures. In 
addition, however, the fixed factor I has weaker loadings by cognitive 
pretests, previous grades, and IQ and has (as we intended) distinctly 
stronger loadings by class Speed and Difficulty. Thus the fixed factor I 
represents a more socialized and active dimension — what happens in the 
class rather than what is brought to it — and therefore seems more 
appropriate to a group characteristic. To suggest this socialization and 
activity, we label it ““Group Achievement.”’ 

The fixed factor II is loaded almost as highly as the “Science Interest” 
Varimax factor II by subject-matter interest but in addition includes (as 
intended) substantial loadings by interpersonal variables such as (-) 
Friction, (-) Cliqueness, Satisfaction and (-) Apathy and (incidentally) a 
higher loading by self-rating of Scientist. Since it represents interpersonal 
feeling plus common interest in school and science, fixed factor II has been 
labelled “Cohesiveness.” 


The fixed factor III is similar to the “Social” Varimax factor III, but is 
loaded more heavily (as intended) by the variables Democracy and (-) 
Favouritism, and (incidentally) less heavily by the variables (-) Apathy, (-) 
Friction, (-) Cliqueness, Satisfaction and (-) Disorganization. For the most 
part, the group structure has been emphasized at the expense of the 
interpersonal affect, so the factor is labelled simply “Status.” 

Qualitatively, then, the fixed factors seem to constitute a good match to 
Bales’ model, with the main exception being a component of subject-matter 
interest in the affect dimension. The deviation may well be due to additional 
information in the particular array of variables in this study, rather than to 


an intrinsic inappropriateness of the Bales model for characterizing 
classroom groups. 
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It has seemed appropriate to describe the principal components as 
Achievement, Group Affect, and Status Disruption, to describe the Varimax 
factors as Achievement, Science Interest, and Social, and to describe the 
fixed factors as Group Achievement, Cohesiveness, and Status. It is 
tempting to characterize these different organizations of the variables as 
three traditional views of the classroom: (a) the principal components 
suggest the psychologists’ division of cognition, affect, and group structure, 
(b) Varimax rotation suggests the educators’ division of achievement, 
motivation, and socialization; and (c) the fixed rotation suggests the social 
psychologists’ (and, in particular, Bales’) division of group progress, group 
feeling, and group status. This characterization is probably too glib, but it is 
important to remember that there need not be any “true” dimensions of 
group characteristics. As is often the case for science in general, different 
models can be abstracted with equal success from the same phenomena; the 
choice of model depends on one’s viewpoint and intentions, and ultimately 
the value of a choice is judged by its fruitfulness in dealing with new 
phenomena. 


Conclusions 


Using 144 high school physics classes as units of analysis, class means 
on a battery of 57 variables were factored in three different factor sets — 
principal components, an orthogonal Varimax rotation, and an orthogonal 
fixed rotation based on Bales’ dimensions of interpersonal interaction in 
groups. Each set included three main factors which could be identified 
roughly as achievement, affect, and group structure. 


The factor set designed to match the Bales model accounts for nearly as 
much variance in the battery as principal components and Varimax factor 
sets do, indicating that fixed factors represent a legitimate way of laying 
out the characteristics of groups. 

The fixed factors Group Achievement and Status are clearly appropriate 
to Bales’ achievement and status dimensions; the fixed factor Cohesiveness 
is very similar to Bales’ affect dimension, with the addition of interest in the 
class subject matter. It may be just this modification which will most 
fruitfully extend the model into the context of whole, uncontrived groups. 

The unexpected negative loading for Authoritarianism on all of the 
achievement factors may be due to the specialized nature of the groups 
studied, but the speculation is offered that it may arise from inconsistency 
between the formal goals of a group and goals generated by intragroup 
interaction. 

In light of the many differences between Bales’ work and the present 
study, both in instruments and in units of analysis, the general success of 
the fit gives further credibility to the Bales dimensions and to their general 
usefulness in analysis of groups. 
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Classical Conditioning Procedures 


The purpose of the study was to test the hypothesis that positively 
evaluated words, when paired with emotionally ambiguous - slide 
photographs of Japanese and East Indian Canadians, would bring about a 
positive change in attitudes toward these two groups. Lower mainland 
(British Columbia) attitudes toward Japanese Canadians are positive and 
attitudes toward East Indian Canadians are negative. It was anticipated, 
on the basis of an earlier study, that attitudes toward the Japanese 
Canadians would change more significantly than attitudes toward East 
Indian Canadians. 

Pretest means of attitudes toward Japanese Canadians and East 
Indian Canadians showed significantly more positive attitudes toward 
Japanese Canadians. The East Indian slides associated with positive 
words resulted in significant treatment effects. There were no other 
treatment effects. (Dr. Kehoe is Associate Professor and Dr. Echols is 
Assistant Professor in the Faculty of Education at the University of British 
Columbia; Dr. Stone is Research Associate at the Atlantic Institute of 
Education, Halifax.) 


The purpose of this study was to test the hypothesis that positively 
evaluated words, when paired with emotionally ambiguous slide 
photographs of Japanese and East Indian Canadians, would bring about a 
positive change in attitudes toward these two groups. The procedure 
employed is a replication of a study by Parish and Lambert (1973) in which 
Vietnamese and Negroes were the target groups of attitude change. 
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Selection of Japanese and East Indian Canadian ethnic groups was 
thought to parallel the situations of the groups used in Parish and 
Lambert’s study. 


Japanese Canadians have been in the lower mainland of British 
Columbia for quite some time and now enjoy a high degree of integration. 
Their ‘neutral position” was thought to be similar to that of the 
Vietnamese. East Indian Canadians have been recently experiencing 
difficulties in integration, and are thus suffering a more accentuated profile. 
This situation seemed to parallel that of the Negroes in the Parish and 
Lambert study. 


The subjects in the Parish and Lambert (1973) investigation were from 
four classes with 92 Caucasian sixth-grade males and females randomly 
assigned to three groups. Prior to the treatments, all students were 
administered a revised semantic-differential rating instrument developed by 
Osgood, Suci, and Tannenbaum (1957). Group One was then shown 21 
emotionally ambiguous slide photographs of Negroes, each one being 
followed by a slide photograph of a positively evaluated word and by a 
blank slide. They were then shown 12 emotionally ambiguous slide 
photographs of Vietnamese, each followed by a slide photograph of a 
neutrally evaluated word, and a blank. Group Two was then given a second 
treatment in which the word order was reversed, i.e., positive words with 
Vietnamese and neutral words with Negroes. Group Three was the control 
group. Each group was given the same pretest instrument as a posttest. 


Results of analyses of covariance showed that no significant (.05) change 
in attitudes toward Negroes occurred in those subjects who had seen the 
positively evaluated words associated with the slides of Negroes. There was, 
however, a significant (.05) change in positive attitudes toward Vietnamese 
in those students who had viewed positively evaluated words associated 
with slide photographs of Vietnamese. Parish and Lambert suggested that 
the reason for this outcome was that Negroes played a more central role in 
the lives of the subjects than did the Vietnamese. Because of attendant 
emotional significance, therefore, attitudes toward Negroes are more 
resistant to change. 


Method 
Subjects 


Subjects for this experiment came from an elementary school located in a 
middle class residential neighbourhood which is part of the greater 
metropolitan area of Vancouver, British Columbia. One hundred and four 
female and male students, with varied ethnic backgrounds, who were 
enrolled in grades five, six, and seven comprised the sample. 


Instrument 


The study utilized the same revised semantic-differential rating 
instrument used in the Parish and Lambert study. Only three changes were 
made. The word “test”? was replaced by the word “survey.” Likewise the 
words “Negroe Race” were replaced by the words “East Indians,” and the 
words “Vietnamese Race” by the word “Japanese.” The Hoyt estimate of 
reliability for the pretest scores was .85 and .91 respectively for the 
Japanese and East Indian scales. 
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Design and Procedure 


The experiment followed a simple pretest, posttest, control group design. 
The students were randomly assigned to three groups: treatment one, 
treatment two, and a control group. All students met in one large open area 
classroom where the directions for the instrument were explained to them. 
The pretest was then administered. Treatment Group One was then sent to 
the library where they received the first treatment. It consisted of twenty 
emotionally ambiguous slide photographs of East Indian Canadians. Each 
was projected onto a screen for two seconds and was followed immediately 
by a positively evaluated word, e.g., happy, and then a blank slide. Twenty 
emotionally ambiguous slide photographs of Japanese were then shown for 
two seconds, each followed by a neutrally evaluated word and a blank slide, 
each for five seconds. In all cases the students read the words aloud. This 
group was then sent to another room where they were given the same 
instrument as a posttest. The control group was then sent to the library 
where they discussed pictures of British Columbia’s geography. They were 
then dismissed to the other room where they also wrote the posttest. The 
second treatment group then came to the library. They were shown the 
same twenty slides of Japanese, but with positive words. They were also 
shown the same East Indian Canadian slides, but with neutral words. All 
words were recited by the students. After the treatment they were sent to a 
separate room to write the posttest. Figure 1 summarizes the design. 


Ethnic Group Words Ethnic Group Words 

P 2 

R d pe ) 

anal E East Indian Positive Japanese Neutral S 

2 Ww Japanese Positive East Indian Neutral aL 

ah 

TC A British Columbia E 

T Geography S 

. . Ld it 
Figure 1. Experimental design. 


Although subjects were randomly assigned to treatment and control 
groups there was some variation in pretest means. Covariance on the 
prettest scores was not possible because student responses were anonymous. 
The “t-test” statistic for repeated measures was therefore used to analyze 
the differences between pretest and posttest attitude scores. 


Findings 

Inspection of the pretest scores in Table 1 indicates that the subjects had 
more positive attitudes toward Japanese Canadians than they did toward 
East Indian Canadians. (Lower scores indicate more positive attitudes.) Ina 
separate analysis, the overall mean score for Japanese Canadians was 
calculated by adding the scores for the two treatment groups and the control 
group. The same procedure was employed using the mean attitude scores for 
East Indians. The respective grand means for the two groups are 24.46 and 
32.70. The “t” value for this difference is 6.40 which is statistically 
significant beyond the .05 level. Table 1 shows that subjects who observed 
slides of Japanese with positively evaluated words showed no change in 
their attitudes toward Japanese. Subjects who viewed slides of East Indians 
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TABLE 1 


T-TEST FOR PRE AND POSTTEST MEASURES OF ATTITUDES 
TOWARD EAST INDIAN AND JAPANESE CANADIANS 


Treat- Ny Pretest Posttest 
ha X gin x S.D. t D 
11 a: East indian Sree 8.8 28.83 8.0 DAWA >.05 
Japanese 24.06 pai PAN 515} Pad 1.46 Nie or 
Dee East Indian Bhs SS} 9.4 SH 10.6 5S) N.S. 
35 
Japanese 25.44 ed Dats) Te 4 .88 Neo 
oe * East Indian 31.69 7.6 7.05 8.4 .19 N.S. 
Japanese BESSY) 6.9 22592 6.1 OD Nie ore 


with positively evaluated words did have significantly more positive 
attitudes on the posttest. There was no significant change when neutral 
words were used with the same sets of slides. 


Discussion 


The purpose of this study was to test the hypothesis that positively 
evaluated words, when paired with emotionally ambiguous slide 
photographs of Japanese and East Indian Canadians, would cause change 
in attitudes. Students who viewed slides of Japanese paired with positively 
evaluated words showed no significant change in attitudes toward 
Japanese Canadians. Students who were shown slides of East Indian 
Canadians paired with positively evaluated words showed a significant 
positive change in attitudes toward East Indian Canadians. 


The findings in this study are the reverse of the findings of Parish and 
Lambert. Their study found that the attitudes toward the less threatening 
group were changed significantly in a positive direction; the present study 
revealed that there was no change in positive attitudes towards the less 
threatening group, but a significant change toward the more threatening 
group. A possible explanation for this may be that subjects already had 
very positive attitudes towards Japanese as shown on the pretest. A 
treatment effect was not measurable because the ceiling of the scale was 
reached on the pretest. 


If other replications show that positive word usage can have a 
significant effect, the implications for public education are important. More 
specifically in this study, the semantic differential showed that the subjects 
perceived East Indians to be dirty. It may be that more positive attitudes 
toward East Indians would result if East Indians were brought into 
association with the word clean. That could be done by emphasizing that 
cleanliness is an important value in the East Indian community. Their 
religion requires them to bathe twice a day. While it is understood that most 
classrooms would not experience intense sessions such as these, the 
pervasive use of positive language in classroom discourse could do much to 
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bring about significant positive change in attitudes towards a variety of 
groups. 
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Sentence-Combining and Syntactic Maturity 
in First Year University 


This study assessed the effect on the syntactic maturity of first year 
university students (1976, 1977) of six weeks of work with Strong’s (1973) 
sentence-combining materials. For determining words-per-T-unit and 
words-per-clause ratios, the Syntactic Maturity Test by O’Donnell and Hunt 
(see Hunt, 1970, pp. 11-12) and a free writing exercise were pre-post 
instruments, the first only being used in 1976 and both in 1977. With 
respect to the Syntactic Maturity Test, analyses of covariance showed 
consistently positive results from the 1976 and the 1977 treatments, with 
all F ratios significant at the .001 level. On the 1977 free writing, a similar 
analysis showed support for the words-per-T-unit measure only, with an 
F ratio significant at the .05 level. In every case, the experimental groups 
registered improvement to levels well beyond those reported by Hunt 
(1970) for skilled adults. The comparatively large increments experienced 
by these university students on these particular measures over sucha short 
period of time augured well for the development and testing of new 
curriculum materials designed to foster flexibility and maturity in young 
writers. (Dr. Stewart is a Professor of English Education, Curriculum 
and Instruction Division at the University of New Brunswick.) 


Two articles in English Journal, February 1976, signal the current 
interest in the subject of this study: “Close-Up: Sentence-Combining — Back 
to Basics and Beyond” by Strong (pp. 56; 60-64) and “The 
Sentence-Combining Myth” by Marzano (pp. 57-59). Strong’s piece is an 
enthusiastic popularization of informal sentence-combining for increased 
control of the syntactic resources of our language, while Marzano’s presents 
data that would seem to undermine the positions taken by Mellon (1969) and 
Hunt (1970) and issues a warning against expecting too much from the 
sentence-combining movement. 


As a matter of fact, during the last decade the whole subject of syntactic 
development in school children and adults has concerned a good number of 
English teachers and researchers. Hunt (1965, 1970) investigated the free 
writing of school children in grades 4, 8, and 12, and the writing of “skilled” 
adults who published in The Atlantic and Harper’s. To eliminate the very 
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real problem of distinguishing between sentences and nonsentences in 
children’s writing he introduced the “minimal terminable unit” or “T-unit” 
which he later defined as “one main clause plus any subordinate clause or 
non-clausal structure that is attached to or embedded on it” (1970, p. 4). For 
Hunt, this T-unit turned out to be the most significant index of syntactic 
maturity for school groups. Second best was clause length and third best, 
clauses per T-unit. Meanwhile, O’Donnell and his associates at Florida State 
University (1965-67) had developed a rewrite passage on an expository 
subject that elicited successively longer clauses and successively longer 
T-units from students in elementary and high school grades; it was this test 
paragraph that Hunt (1970) used in his study of syntactic maturity in 
schoolchildren and adults and that this researcher used in this study of the 
effect of sentence-combining practice on the writing of first-year university 
students. 


Five curricula designed to increase sentence-embedding were examined: 
Bateman and Zidonis’ (1966) in the ninth and tenth grades, Mellon’s (1969) 
in the seventh grade, Miller and Ney’s (1968) in the fourth grade, O’Hare’s 
(1973) in the seventh grade, and Combs’ (1975, 1976) also in the seventh 
grade. All reported that groups using their experimental materials showed 
significantly more growth than comparable control groups, and all observed 
no negative effects. The first two coupled definite instruction in the use of 
transformational-generative grammar with extensive sentence-combining 
practice, while the other three relied almost exclusively on a modified signal 
system to provide patterns for carefully structured practice and found that 
their ‘“grammar-free” experimental lessons resulted in syntactic maturity 
increments several times the normal. Also noted was Cooper’s (1973) guide 
for writing sentence-combining problems. Finally, in his search for new 
materials, the writer came upon Strong’s (1973) Sentence Combining — A 
Composing Book which provided exercises that, unlike others, did not utilize 
a system of transformational signals or guides and which encouraged the 
students to experiment, to evaluate, to rewrite, and to compose on their own. 
Both in content and in form, this text seemed ideally suited for the freshman 
students he was teaching. 

Two main factors prompted the present project: the current furor over the 
writing competencies of first-year university students, and the need to find a 
teaching strategy that might make for a relatively rapid increase in their 
syntactic capabilities. Some studies (Vitale, King, Shontz, & Huntley, 1971; 
Perron, 1974) had showed that short intensive exposure to sentence- 
combining practice accelerated growth in syntactic maturity; and, al- 
though Schmeling (1969) concluded that T-unit and clause length did not 
clearly differentiate between poor and good freshman papers, the ambitious 
review by Stotsky (1975) demonstrated quite convincingly that these could 
be useful measures with older groups and -that sentence-combining practice 
did increase both the maturity and the quality of writing in elementary and 
high school students. It seemed reasonable, therefore, to hope that this same 
effect could carry over into first-year university at least. O’Hare’s (1973) and 
Combs’ (1975) encouraging research gave rise to the design and procedure 
chosen, while Strong’s (1973) book provided the basic material for the 
control groups and Hunt’s (1970) study a test instrument. Finally, two 
six-week freshman “Communications” modules, running concurrently, 
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presented a good opportunity for an investigation beyond the high school 
level. 


Hypothesis, Design and Procedure 


The purpose of this study was to test whether the use of the 
sentence-combining practice material produced by Strong (1973) would 
increase the normal syntactic development of first-year university 
Education students over the short six-week period allowed in a 
“Communications” module. Subjects from established class sections were 
randomly selected and assigned to two pairs of control and experimental 
groups, and the O’Donnell-Hunt (1970) Syntactic Maturity Test (see Hunt, 
1970, pp. 11-12) and an analysis of free writing used to measure develop- 
ment in two factors of syntactic maturity. The amount of growth expe- 
rienced by the experimental groups was compared with that of the control 
groups and, where possible, with normative data reported by Hunt (1970). 
The work of O’Hare (1973) and of Combs (1975, 1976) provided a basis 
for both design and procedure, but no attempt was made to estimate the 
relationship of sentence-combining practice to overall quality of writing, 
which both of these researchers reported was significant, and no attempt 
was made to estimate the relative permanence of control group gains, 
although Combs (1975, 1976) showed significant retention after eight 
weeks as measured in words per T-unit. 


O’Donnell (1976) in his Critique of Some Indices of Syntactic Maturity 
concluded “It may turn out that in spite of lack of precision, T-unit length is 
still the most useful and useable index of syntactic development over a wide 
age-range and that mean clause length is the best single measure of 
syntactic complexity at the high school level and beyond.” Since this study 
involved first-year university students over such a short time-span, it 
seemed wise, then, to use here only the two indices most generally promising 
to date: words per T-unit and words per clause. 


Hypothesis 
The study was designed to test the following hypothesis for significance 
at the .05 level: 


The experimental groups (1976 and 1977), which were exposed to 
Strong’s (1973) sentence-combining practice material over six-week 
periods, will score significantly higher on two factors of syntactic 
maturity than the corresponding control groups which were not 
exposed to the same material. 


Design 


With the exception of the method for the selection of subjects, the 
pretest-posttest control group design described by Campbell and Stanley 
(1968, p. 13 ff.) was utilized. It took the following form: 


R O1X O2 
R O3 O4 
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Procedure, 1976 


During a six-week period in the spring of 1976, the writer taught one 
module of a four-module “Communications” course to two class sections at 
the University of New Brunswick (Fredericton, N.B., Canada) using 
Strong’s (1973) Sentence Combining: A Composing Book while his colleague 
S. E. Bateman taught another module of the same course to two other 
sections using Flanigan’s (1974) Patterns of Exposition: What, How & Why. 
Average enrollment per section was 25. Each student was involved in 18 
hours of in-class activity (three hours per week) and approximately 25 hours 
of out-of-class writing. At the beginning and the end of the six weeks, in two 
specially designated periods, all four class sections did the O-Donnell- 
Hunt Syntactic Maturity Test. There was no discussion of the project 
during the regular class and special testing periods, and the pre and 
posttests were included as part of the regular work of the modules. Finally, 
after the posttest periods, balanced groups were set up by selecting at 
random 32 students from the exposition class sections to form the control 
group and another 32 from the sentence-combining sections to form the 
experimental group. Then, for the 64 students thus selected, the pre and post 
“aluminum” paragraphs resulting from the O’Donnell-Hunt test were 
analyzed for words per T-unit (W/T) and words per clause (W/C) following 
procedures described by Hunt (1970, p. 4.) and O’Hare (1973, pp. 47-49). 


The work done by both groups followed very closely the textual materials 
and guides provided by the two authors. In both cases, during the six-week 
period, there was coverage of the topics in the texts, and all students 
completed a sampling throughout the texts of at least half of the exercises 
provided and activities suggested. In both cases, the objectives of the two 
modules were discussed, initially, with the students, with the general 
pedagogical rationale of the two authors adhered to as far as seemed 
reasonable. 


In the experimental group, the writer made use of the detailed guides in 
Strong’s (1973) Sentence Combining: A Composing Book and the 
accompanying IJnstructor’s Manual which included notes on the pro’s and 
con’s of sentence-combining and a number of practical teaching suggestions 
emphasizing individual, small-group, and whole-class exploration and 
evaluation of sentence-combining options arising out of the students’ own 
syntactic competencies. Content for each composition in Phase I was 
provided in the form of a sequence of kernel sentences grouped to facilitate 
choice of interesting syntactic alternatives. Topics and sequences 
progressed from simple to complex. Phase II was based on the work of 
Francis Christensen (1967), and each lesson presented a model sentence the 
structure of which was followed as students worked with the blocks of 
kernels provided. Techniques acquired in, both phases were periodically 
tested for usefulness in short compositions on individually-chosen topics. As 
far as theory and terminology are concerned, the lessons were practically 
“grammar-free.”’ 

Meanwhile, the control students concentrated on expository writing — 
selecting topics; gathering data; writing thesis statements; outlining, 
drafting, discussing, and revising short reports. Flanigan’s text (1974) was 
followed as closely as possible. There was some guidance in sentence 
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revision, but in the traditional manner only. Grammatical terminology was 
avoided consistently. 


Finally, it should be noted that the students in both groups were locked 
into the usual university time and course schedules and that there was no 
practical way to control their in-and-out-of-college language experiences, 
although all were enrolled in the first year of a four-year Education 
programme with required core courses designated “Introduction to 
Teaching” and ‘(Communications I.” It was also noted that, at the end of 
the academic year, the difference between the mean grade-point averages of 
the two groups on “Communications |” was only .05. 


Summarizing, independent variables included methods and materials: 
sentence-combining practice and related writing versus instruction and 
practice in the preparation and presentation of expository reports; 
dependent variables were two factors of syntactic maturity: words per 
T-unit and words per clause. Obvious limitations related to the selection of 
the groups involved and the language experiences of the subjects outside the 
sentence-combining and report-writing modules. 


Procedure, 1977 


The results of the 1976 project gave rise to the decision to repeat the work 
in 1977 under conditions as nearly as possible identical with those of a year 
earlier except that (1) both control and experimental groups were taught by 
the writer and were balanced at 25 each; and (2) in addition to the 
O’Donnell-Hunt Syntactic Maturity Test’s “aluminum” paragraphs, 300- 
word samples of free writing were elicited at pre and posttest time and 
also analyzed for words per T-unit and words per clause. Thirty-eight 
topics or “leads” were provided, and each student took one of these or one 
of his own choosing and expressed for other students his own opinions 
or concerns. 


Results 
Results, 1976 


These are summarized in Tables 1, 2, and 3 and confirm the hypothesis 
that the experimental group, which worked with Strong’s (1973) 
sentence-combining material, will score significantly higher on two factors 
of syntactic maturity than the control group which did not work with this 
material. 


Pretreatment and posttreatment W/T and W/C mean scores of control 
and experimental groups were compared by t tests for two independent 
means assuming unequal variances. Table 1 shows that, at pretreatment 
time, there were no significant differences between the two groups and 
supports the assumption that the groups were equivalent as the result of 
randomization. On the other hand, the comparable values at posttreatment 
show that the experimental group attained a highly significant gain, at the 
.001 level of confidence, over the control group on both measures. 

Pre and posttreatment mean change scores (posttreatment means minus 
pretreatment means) were then analyzed by t tests for correlated measures. 
Table 2 reveals no significant change for the control group over the six-week 
period involved while, for the experimental group, change was significant 
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TABLE 1 

COMPARISON OF PRE AND POSTTREATMENT MEAN SCORES ON TWO 
FACTORS OF SYNTACTIC MATURITY: CONTROL AND EXPERIMENTAL 
GROUPS, SYNTACTIC MATURITY TEST, 1976. 


Factor Mean SD Difference t value df 
Pretreatment 
W/T 
Control 12.48 2.89 
Experimental 12 Ate 370 sO4 -41 (NS) 62 
wW/C 
Control Sy Sho} 127.0 
Experimental 8.16 1.12 = 57 -48 (NS) 62 
Posttreatment 
W/T 
Control ILA AW Pde Als} 
Experimental OS 4.03 Sate! 7.18% 62 
W/C 
Control 7.94 47 
Experimental OR) Ze 2.81 5.44% 62 


* Significant at or beyond the .001 level. 


TABLE 2 
COMPARISON OF PRE AND POSTTREATMENT MEAN CHANGE SCORES ON 
TWO FACTORS OF SYNTACTIC MATURITY: CONTROL AND 
EXPERIMENTAL GROUPS, SYNTACTIC MATURITY TEST, 1976 


Factor Pretest Posttest Change SD t value df 
Control 
W/T 12.48 WAS OP -0.46 DID) Ih aly (QNES) She 
W/C 8255 1.94 =(0)5.3)) Io iSke! OZ a GN'S)) 31 


Experimental 


W/T WP AVE Nos) (14.78)° 5.01 4.36 6.51% Shi 
W/C 8.16 10575 G9.95) De NS) Ze) So aks 31 


@ Skilled adults, Hunt, 1970. 


* Significant at or beyond the .001 level. 


again at the .001 level on both factors. The experimental group’s mean 
pre-post change score of 5.01 words per T-unit over a six-week period at first 
year university compares most favourably with the gain of 6.12 words per 
T-unit reported by O’Hare (1973, p. 54) for his seventh graders over a much 
longer period of time. Also the present study’s experimental group wrote 
17.83 words per T-unit and 10.75 words per clause at the end of the six-week 
period — considerably more than the 14.78 and 9.95 reported by Hunt (1970, 
p. 20) for the skilled adults he tested with the same instrument used here. 


267 


M. F. Stewart 


TABLE 3 


ANALYSIS OF COVARIANCE OF POSTTEST WITH PRETEST SCORES: 
SYNTACTIC MATURITY TEST, 1976 


Degrees of 


Source of Variance Mean Square F Ratio 
Freedom 

W/T 
Between Groups Sie ye eal il 59.61% 
Within Groups BBV Actelo) 61 

W/C 
Between Groups 1335167 1 35.42% 
Within Groups 79.86 61 


* Significant at or beyond the .001 level. 


TABLE 4 
COMPARISON OF PRE AND POSTTREATMENT MEAN SCORES ON TWO 
FACTORS OF SYNTACTIC MATURITY: CONTROL AND EXPERIMENTAL 
GROUPS, SYNTACTIC MATURITY TEST AND FREE WRITING, 1977 


Factor Mean SD Difference t value df 
Pretreatment 
W/T 
Control 11.64 2.40 
(Gee w/.6)) (So2Z9) 
Experimental WS Sy) 32609 tees} eS has 48 
(14.60) (aly) CoslG) (GALS aN S)) (48) 
W/C 
Control 7.05 1.00 
( 8.19) (Gigs) 
Experimental 8.56 ILabyA Oil 2.48%** 48 
(Che 7K) G74) (Qus57) (1.21) (NS) (48) 
Posttreatment 
W/T 
Control TO '5) 2503 
@l6s25)) (3.42) 
Experimental Neer 3.80 (#5 3\0) 4. 85*** b 48 
(18.14) (4550) (1.89) (1.67) (NS) (48) 
W/C 
Control 780 O02 
( 9.69) Ces.) 
Experimental Ai6 PL Koho" 356 5.90**x* 48 
Gaile 2/9) e375) (1.60) (1.93)* (48) 


= Numbers in parentheses show free writing data. 
Significant to the .10 level only. 


* Significant at or beyond the .05 level. 
** Significant at or beyond the .0O1 level. 
kkk Significant at or beyond the .001 level. 
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Table 3 presents the results of a simple analysis of covariance used to 
ensure statistical control to the end that the results described above may be 
attributed, within limits of error, to the treatment variable and to no other 
causal circumstance. Here the significance of the F ratio again confirms the 
hypothesis. 


Results, 1977 


Because the 1976 results were so uniformly positive and because they 
were based on the use of a single evaluative instrument, it seemed 
worthwhile to repeat the experiment in 1977, this time having the 
experimenter teach both experimental and control groups and adding free 
writing to test the consistency of the original findings. 


The 1977 results are summarized in Tables 4, 5, and 6, and give qualified 
support to the hypothesis. 


This time, with new experimental and control groups, the ¢ tests 
compared the pre and posttreatment means for free writing as well as for 
the Syntactic Maturity Test, and the results were not as clear-cut as 
previously. The pretreatment means (Table 4) show significant W/T and 
W/C differences (at the .05 and .01 levels) on the Syntactic Maturity Test 
but not on the free writing. Randomization had apparently established 
equivalency on the free writing only. Also, although the posttreatment 
means show significant W/T and W/C differences (at the .001 level) on the 
Syntactic Maturity Test, the comparable differences on the free writing are 
not significant for W/T and at the .05 level only for W/C. The Table 4 data, 
then, give little support to the hypothesis. 

The pre and posttreatment mean change scores in Table 5 show no 
significant change for the control group except on the free writing W/C 
measure where the significance is at the .05 level while, for the experimental 
group, the change scores all give t values significant at the .001 level. These 
positive indications are in contrast with some of those in Table 4. Also, this 
1977 experimental group’s change means (W/T and W/C) on both the 
Syntactic Maturity Test and the free writing compare most favourably with 
those for the 1976 group, and all experimental posttest scores are again 
higher than those reported by Hunt (1970, p. 20) for his skilled adults. 

The analysis of covariance in Table 6 confirms the first hypothesis as far 
as the Syntactic Maturity Test results are concerned, but it leaves the free 
writing results in some doubt, with this measure’s W/T F ratio significant 
at the .05 level only and its W/C F ratio below that level. 


Summary 

With respect to the Syntactic Maturity Test, the 1976 groups were 
equivalent as a result of randomization, and the experimental group 
experienced growth and established superiority significant in both cases at 
the .001 level. An analysis of covariance further confirmed the hypothesis. 

As far as the Syntactic Maturity Test was concerned, the 1977 groups 
were not equivalent as a result of randomization, but they were with respect 
to the free writing samples. On both the Syntactic Maturity Test and the 
free writing, the experimental group experienced growth significant at the 
001 level. However, it was only on the Syntactic Maturity Test that 
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TABLE 5 


COMPARISON OF PRE AND POSTTREATMENT MEAN CHANGE SCORES ON 
TWO FACTORS OF SYNTACTIC MATURITY: CONTROL AND 
EXPERIMENTAL GROUPS, SYNTACTIC MATURITY TEST AND FREE 
WRITING®, 1977 


Factor Pretest Posttest Change t value df 
Control 

W/T 11.64 iLike 9S pail 5 5ie) (GNIS) 24 

(14.76 @lGis325)) (1.48) 1.96 (NS) 24 

W/C oS) 7.80 oll) w/o NS) 24 

(Si Siee9)) (C B59) G4 9) (32 is 24 


Experimental 
W/T ISS Sy Wc, 485) 2407 4.16%** 24 


(14.60) (less) Gaaease (3.58) (4.44) ** (24) 
w/C 8.56 11.36 2 2.80 hy G1ee 24 
ae iG) (iie29) 6 (2.53) (3.49) (24) 


* Numbers in parentheses show free writing data. 
Skilled adults, Hunt, 1970. 


* Significant at or beyond the .005 level. 
**k Significant at or beyond the .001 level. 


TABLE 6 


ANALYSIS OF COVARIANCE OF POSTTEST WITH PRETEST SCORES: 
SYNTACTIC MATURITY TEST AND FREE WRITING, 1977 


Degrees of 


Source of Variance Mean Square F Ratio 
Freedom 
Syntactic Maturity Test 
W/T 
Between Groups IB yA 5 Sy 1 17.97%** 
Within Groups 8.47 47 
W/C 
Between Groups IAW eS il 27.83%%* 
Within Groups Asif 47 
Free Writing 
W/T 
Between Groups 49.40 a 3.74* 
Within Groups 13 1¢) 47 
W/C 
Between Groups 19.38 1 2.50 (NS) 
Within Groups gh 47 


* Significant at the .05 level. 
** Significant at or beyond the .001 level. 
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superiority was established at the same level. Again, as far as the Syntactic 
Maturity Test was concerned, an analysis of covariance supported the 
major hypothesis. On the free writing, however, the support was for the 
W/T measure only, with an F ratio significant at the .05 level. 


Both in 1976 and again in 1977 the experimental group registered 
improvement on both the W/T and W/C measures well beyond the levels of 
the skilled adults in Hunt’s 1970 study. The 1977 free writing mean of 18.14 
words-per-T-unit for these college freshmen compares most favourably with 
the 14.78 level attained by Hunt’s (1970) mature group. 


The covariance tables emphasize the consistency of the positive results 
of the 1976 and 1977 treatments as measured by the Syntactic Maturity 
Test, with all F ratios significant at the .001 level. They also indicate the 
somewhat weaker case for the effect on the improvement in free writing. 
However, the degree of consistency from year to year is not inconsiderable 
when one keeps in mind the usual variability of freshman groups, the short 
time-span allowed, and other obvious limitations. 


Conclusion and Comment 


The results allow the general conclusion that the writing of the 
experimental groups, 1976 and 1977, was syntactically more mature than 
that of the control groups and syntactically more mature than that at 
pretest time. When compared with Hunt’s (1970) norms, the level of 
syntactic maturity of the experimental writing exceeds that attained by 
mature adults as measured by W/T and W/C. Also, freshman W/T gains for 
the six-week period were, in each case, at least half as great as gains 
reported by O’Hare (1973, p. 58) for his grade seven students over an 
extended eight-month period and greater than those reported by Combs 
(1976, p. 142) for students at the same level over a four-month period. The 
fact that the lowest W/C gain here was 2.53 as compared with 1.49 (O’Hare) 
and .70 (Combs) supports the belief that clause length can be a significant 
measure of syntactic maturity at the upper levels. In all of this, however, it 
must be remembered that the 1977 groups exhibited some gain which can 
not be attributed to the treatment (See Table 4). Nevertheless, the overall 
picture is a positive one. 

The 1976 results, which were based on the Syntactic Maturity Test alone, 
uniformly support the hypothesis. For 1977, however, the two groups 
presented obvious problems: at pretreatment time, randomization succeeded 
in equating them on the free writing but not on the Syntactic Maturity Test, 
and yet the analysis of covariance showed more significant differences in 
the Syntactic Maturity results than in those for the free writing. The 
consistency of Syntactic Maturity Test gains in both years is interesting, 
and it seems reasonable in view of the fact that the competencies required in 
coping with Strong’s sentence-combining materials and the test itself are 
very similar. What contrasts there were between the two sets of data may be 
due, in part at least, to teacher-treatment influence: in 1976 the 
experimental group was taught by the researcher while the control group 
was taught by a colleague; in 1977, both groups were taught by the 
researcher. It appears, then, that variation in the 1977 procedure succeeded 
in producing variation in the 1977 results. 
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No attempt was made here to relate the syntactic maturity ratios to 
teachers’ judgements as to “quality” of writing; but the positive findings of 
O’Hare (1973) and of Combs (1976) on this relationship indicate that these 
indices may serve as guides in this most important area at the junior high 
school level. Consequently, it seems reasonable to assume that the W/T and 
W/C increments presented in this study may indicate growth in “quality” 
at the freshman level as well. 


Combs (1976) was first to demonstrate through delayed posttesting that 
W/T and W/C increments continue, at reduced levels, to distinguish 
significantly between control and experimental groups in studies such as 
this. In the present work, although nothing of the sort was attempted, the 
increments actually attained during a six-week period seem all the more 
surprising because the students involved were initially already writing at 
about the level of Hunt’s “average” adults (1970, p. 20); it is quite likely that 
these effects would not erode completely and that they might even be 
maintained or augmented if appropriate practice were continued. 


Implications 

The basic purpose of this work was to test the value of Strong’s 
sentence-combining materials in increasing the syntactic maturity of 
first-year university students over six-week periods in two successive years 
and, in general, its affirmative conclusions correspond with those of Mellon 
(1969), O’Hare (1973), and Combs (1976). But whereas Mellon coupled 
sentence-combining practice with direct transformational-generative 
grammar instruction, O’Hare and Combs omitted the study of grammar as 
such and relied solely on patterned practice. In contrast with these, Strong’s 
Sentence Combining: A Composing Book (1973) gave the experimental 
classes in this study a great deal of opportunity for freer and more extensive 
experimentation: the shaping of short compositions from groups of kernel 
sentences provided, concentrated practice in the use of the ‘“‘free modifiers” 
and the “cumulative sentences” so persuasively advocated by Francis 
Christensen (1967, 1976), and a good deal of exploration and evaluation of 
rhetorical options. Thus this study represented a departure from previous 
investigations not only in the short time-span allowed and the nature of the 
population included but, more importantly perhaps, in the type of 
sentence-combining practice and free writing they did. And, in spite of the 
numerous variables that bear on the results of this and other 
sentence-combining studies (See Combs, 1976, p. 147) and the unevenness of 
the 1977 results, the net effect here gives added support to the general value 
of sentence-combining, extends its range into first-year university, and 
emphasizes the usefulness of diversified practice closely related to the 
composing process. 

This study indicated that Strong’s materials were appropriate for the 
freshman level as they did help to make these particular classes places for 
mutual inquiry and active experimentation involving the varied syntactic 
options open to them when challenged with a series of kernel sentences on a 
topic of concern. In this case, transfer of attitudes and skills to other writing 
seemed quite natural, even inevitable. 


In Chapter III of Wardhaugh and Brown’s (1976) A Survey of Applied 
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Linguistics, Jean Malmstrom (pp. 50-53) traces the impact of linguistic 
grammar on the production of curricular materials with particular reference 
to sentence-combining, 1965-75, and she cites this as a promising example of 
a case in which “the rule of lag between research and technology did not 
hold” (p. 52). Unfortunately, however, the gap between the new materials 
and actual classroom practices remained, for, as Hook (1976) comments, 
“This fine material is about ready for the children but are the teachers 
ready for the fine material?’’(p. 211). Hook was discussing a new elementary 
school language and composition series that contained sentence-combining 
materials, but his remark applies equally to teachers of older students. 


In the context of the present furor over the maturity and quality of 
college writing, the freshman composition teacher should now be 
encouraged to study the claims and limitations of sentence-combining 
research and the contribution that it might make to his efforts. An 
examination of the exercises of O’Hare (1973, 1975), Cooper (1973), Strong 
(1973), and Perron (1974) will provide him background for constructing and 
trying out materials adapted to the particular needs of the students he is 
teaching. Most promising seems to be the movement away from formally 
patterned practice and towards the integration of sentence-combining 
activities with the whole composing process. Above all, the teacher can help 
his students discover for themselves the basic language competencies they 
already possess, and, by every means. possible — _ including 
sentence-combining — help them gain confident control over the syntactic 
resources of their language. 


O’Hare (1973, pp. 76, 77) made twelve suggestions for further research, 
and this writer supports these in toto. In particular, he feels that the effects, 
both positive and negative, of different kinds of sentence-combining 
practice on various types of high school and junior college writing should be 
investigated and that careful and balanced assessments should be made of 
this type of activity in different situations and with different student 
populations. 
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Call for Papers 


In its endeavour to encourage discussion among educational researchers 
of diverse traditions, the AJER invites the submission of manuscripts for 
our new categories, “Perspectives” and “Essay Reviews.” The perspective 
paper, the first of our new categories, presents through the essay format 
what might be termed a speculative or philosophic point of view. The paper 
assesses current theoretical or practical problems of educational research 
and considers their implications for further study. Such a paper, which at 
present does not easily fit existing procedures of research reporting, might 
center on the analysis of new directions and prospects in educational 
research. An examination of aims, procedures, and methods of research, 
including a discussion of their limitations and possibilities, might form the 
core of the argument; a case study could be employed to substantiate the 
position. It differs from the standard AJER manuscript submission not only 
in emphasis but, more importantly, in addressing and perhaps challenging 
fundamental questions of research. The essay by E. Miklos on 
“International Developments in Educational Research” is the first of our 
“Perspectives.” 

The category of “Essay Reviews” is distinguished from the established 
book review format in breadth and in depth. The essay review provides a 
comprehensive analysis of two or more recently published studies of 
educational research. It should focus on an examination of the central 
problems presented by the authors, should select one or two for close study 
and comparison, and should consider their implications for the development 
of educational inquiry. 

Authors interested in contributing to either “Perspectives” or “Essay 
Reviews” are invited to submit in advance an abstract of their proposal. 
Although the categories will not appear on a regular basis, they will 
supplement the Journal’s existing approach in reporting the results of 
educational research. Additional information will gladly be forwarded upon 
request. 

In addition to encouraging submissions to “Perspectives” and “Essay 
Reviews”, the AJER continues to welcome manuscripts reporting the results 
of both empirical and nonempirical research. To this end, contributors are 
referred to the back inside cover of the present issue for a guide to the 
preparation of manuscripts. 


Henry W. Hodysh 
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PERSPECTIVES 


International Developments in 
Educational Research 


E. Miklos 
The University of Alberta 


Canadian educational research is confronted by numerous problems. If 
research is to contribute to educational development in important ways, 
then these problems must be addressed by educational researchers and by 
those who are responsible for research policies at various levels of 
government. The solutions to the problems must be appropriate to the 
Canadian scene; they must take into account the unique factors which 
impinge on educational research in Canada. Solutions should not be 
imported from elsewhere; however, this does not mean that those involved 
in finding solutions to problems can afford to remain uninformed about 
developments in other educational settings. On the contrary, deliberate 
attempts should be made to determine what solutions to similar problems 
have been proposed or adopted in a variety of contexts. 


The paragraphs which follow were stimulated by a conclusion: 
Canadian educational researchers have given inadequate attention to 
developments in educational research outside of North America. The 
conclusion is neither contaminated nor distorted by empirical data; it is an 
impression resulting from various forms of nonsystematic observation. This 
paper attempts to provide the basis for suggesting that educational 
researchers in Canada and the United States might find it worthwhile to 
give greater attention in future to developments on the international scene; 
others, Holdaway (1976) for example, have made similar suggestions on the 
basis of more convincing evidence. Major sections of the paper focus on the 
problems which attract the attention of researchers, a perspective on some 
developments in other countries, and the examination of three specific areas 
in which Canadian educational research might benefit from international 
exchange and dialogue. 


Problems of Researchers 


Whenever and wherever educational researchers gather to discuss their 
work, two major themes are likely to dominate. The first relates to the 
substance of research, to new developments in various areas, to recent 
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advances in research methodology and design, and to promising directions 
for future research. These matters are the subject of formal meetings and of 
the discussions which follow. The other theme relates more closely to the 
circumstances surrounding educational research and to the problems of 
researchers. Although these topics serve as the basis for formal discussions, 
they are more likely to be the subject of informal conversations. 


The problems of educational research are numerous; the list is long and 
easy to construct. It includes items such as the following: educational 
research does not enjoy support, particularly financial support, comparable 
to that enjoyed by research in other fields; the educational researcher is not 
admired as much as is the researcher in the physical sciences or medicine; 
educational research is fragmented, uncoordinated and unorganized; 
researchers are inadequately trained for the various demands of their role; 
the fruits of their labours are disregarded by practitioners and policy 
makers; and the autonomy of the researcher is precarious. These are 
exaggerations; nevertheless, issues and problems of educational research 
funding, of the relationship of research to practice, of the status of the 
researcher, and of the control and coordination of research are important 
issues. Researchers experience such problems in their everyday lives and 
may be distracted from their work by them. The problems probably warrant 
systematic attention. 


These problems are part of the Canadian research situation; however, 
only a few researchers have information about the extent to which they 
obtain elsewhere. Which of these, if any, are perceived as problems in other 
educational systems? How do researchers cope with the constraints imposed 
by the environment? What alternative models have been adopted for 
funding and coordinating research activities? A search for answers to these 
questions leads one into an examination of the literature on educational 
research in other countries. 


International Developments 


The literature on educational research at an international level has two 
main characteristics. The most striking feature is that there is not much of 
it; comparative international perspectives on educational research other 
than those provided by the OECD (Organization for Economic Cooperation 
and Development), the Council of Europe, and Unesco appear to be very 
limited. The second characteristic is the significant influence which a few 
countries appear to be having on the definition of educational research and 
on issues relating to the role of educational research in relation to 
educational development. In English-speaking countries, and to some extent 
in Western European countries, that influence is being exercised largely by 
the United States. : 


American Influence 

Although educational research activities do have distinctive 
characteristics in different countries, the prescriptive literature on 
educational research clearly shows the evidence of the effects of practice 
and thought in the United States. In terms of such criteria as the extent of 
funding of educational research, the productivity of educational researchers, 
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and the influence of research on practice, the United States is frequently 
used as an example of the standards which other countries should strive to 
achieve. 


Some indication of the importance of the United States literature is 
provided in a 1975 study by Malmquist. Although this particular study 
focused on the state of educational research in Europe, about 45 percent of 
the nearly one thousand items in the review of the literature originated in 
the United States. The maximum for any European country was 10 percent, 
about the same as that for international organizations. Even the latter may 
have been influenced indirectly by American sources. The items in the 
Malmquist literature review concerned only the nature and form of 
educational research and not its content. If reviews of any substantive area 
had been included, the dominance of researchers in the United States would 
no doubt have been overwhelming. 


The impact of the research activity in the United States has been felt 
more heavily in Canada than in other countries. Although not influenced 
greatly by developments in structures and methods of financing, Canadians 
have been influenced by emphases in specific substantive areas and by 
ideas about the directions which future research should take. On all but a 
few topics, a review of literature is not complete without a review of the 
United States literature. 


One of the examiners for the recent OECD review of educational policies 
in Canada, Professor Noah of Teachers College, Columbia University, 
observed that Canadian educational research was heavily influenced by the 
United States presence and example in terms of problems tackled, research 
strategies used, and means of disseminating results. He wondered if 
Canadians should take a national view of research and dissemination 
problems not merely for the sake of being different but because there are 
unique issues and opportunities for research that may be being neglected 
(OECD, 1976, p. 186). Most Canadian researchers would agree with this 
perceptive observation with respect to both what is and what should be. 
Educational research in Canada is different from that in the United States 
to some extent, but not much different. 


Educational Research in Europe 


There are numerous hazards in attempting to generalize about the state 
of educational research in other countries. One reason for this has to do 
with the limited current information which is readily available; the other 
relates to the varying definitions of educational research and to the various 
types of structures for carrying out these activities. A recent OECD (1974) 
report classifies countries according to the stage of development of 
educational research; the stages are labelled simply Stage I and Stage II. 


Educational research and development in Stage I is characterized by 
what might be termed “free” research. Problems are identified by the 
researcher, and research is conducted for the purpose of producing 
knowledge through scholarly enquiry in order to develop a discipline. Any 
effect which the research has on practice or on educational policies is 
indirect; that is, research contributes to building a store of knowledge which 
is available to those who may wish to relate it to practice. There are no 
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direct links between researchers and policy makers. The typical research 
center consists of a university chair and the associated research institute. 
The centers are small with a high proportion of part-time staff. A 
significant proportion of the research is conducted by doctoral students who 
have limited access to resources other than their time and energy. 


Research which is conducted tends to be single-discipline research. The 
studies focus on nonempirical, theoretical and apolitical problems; there is 
heavy reliance on documentary sources and relatively straightforward 
data-gathering. Emphasis is placed on philosophical, historical, and 
comparative studies; socio-psychological problems are likely to be in- 
vestigated by those who have a more empirical orientation. In this stage of 
research, the usual development strategy involves pilot schools in which 
innovations are tested before they are implemented throughout the system. 
In some countries certain schools are designated permanently as 
experimental schools. Of course, at times there may be implementation of 
innovations without any try-out. 


By contrast, educational research and development in Stage II has quite 
different characteristics. The purpose of research is to produce knowledge 
which can be used to make decisions; that is, research is client- or 
user-oriented. There is more direct communication between reseachers and 
policy makers, and research activities may be related to development plans. 
As a result, research problems are determined by clients either through 
consultation with researchers, commissioned research, or other means. 


In Stage II, the research and development organizations vary from 
university-based institutes and autonomous centers to ministry-related 
research and development units. These organizations tend to be larger and 
more numerous than in Stage I, and there are direct or indirect means for 
coordinating research efforts. Researchers have access to more resources, 
are more highly trained and have higher prestige. There is greater 
probability of multidisciplinary research which includes empirical studies of 
learning processes, of teaching methods and materials, and of school 
organization. Innovations are tried out on a comprehensive basis in selected 
regular schools and gradually extended to all schools in the system. 


According to the OECD analysts, a number of European countries — in 
particular France, Italy and Spain — are still in Stage I. Those which seem 
to be well into Stage II include Sweden, Norway, Finland, the Netherlands, 
and England. A number of countries — such as Federal Republic of 
Germany — are in varying degrees of transition from the first to the second 
stage. There are some indications that the USSR and some other socialist 
countries could be added to the list of those in transition. Although much 
research activity would still be similar to that of the first stage in these 
countries, information provided by Birzea (1973) suggests that there is a 
shift to a more policy-oriented, comprehensive approach to educational 
research and development. 

Although accurate comparative descriptions are not available, most of 
the developing countries of Asia, Africa, and Latin America are probably in 
the early phases of Stage I; however, it is likely that the transition to Stage 
II could be made without awaiting the further development of Stage I. Both 
fundamental research and the applied or decision-oriented research could be 
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developed simultaneously. An illustration of this type of development may 
be India which has many of the characteristices of Stage I but where there 
is also an awareness of the need to develop complementary research 
activities (see, for example, Buch, 1974). 


In general, a more useful conceptualization of Stage II might be to 
consider that it includes everything of Stage I with the addition of new 
purposes and institutional arrangements. The development of educational 
research could then be considered in terms of the elements added rather 
than the extent to which previous activities have been supplanted. 


Coordination of Research 


The European countries may also be categorized on the basis of whether 
they take a centralized or a decentralized approach to the coordination of 
educational research. Included among the countries which attempt to 
coordinate educational research directly through centralized mechanisms 
are Sweden, Finland, Denmark, France, Portugal, Hungary, and the USSR; 
those which have adopted decentralized or indirect forms of coordination 
include Norway, England, Germany, and Yugoslavia (OECD, 1974). Clearly 
there is no necessary relationship between the stage of research 
development and whether a centralized or decentralized approach is 
adopted for planning and coordinating educational research activities. 
There will, of course, also be variations within these two broad categories of 
either a centralized or decentralized approach. 


Basically, where centralized planning and coordination exist, the 
research problems or programs are defined by an agency closely associated 
with the Ministry of Education, and these themes are communicated to 
research centers and individual researchers. Research projects are approved 
centrally, and funding is largely from public sources. Two countries which 
have adopted quite different forms of the centralized approach are Hungary 
and Sweden. 


According to a recent report (Agoston, 1977), Hungary has a long-term 
research plan which encompasses educational research. Coordination is 
achieved through a central committee at the ministerial level. The 
committee selects research units or individuals for specific research projects, 
recommends funding to the Minister, receives progress reports, and 
evaluates interim and final results. These centrally coordinated projects 
may not include the whole of research activity. As Agoston (1977) stated: 

Any research institution, any department of education may do research outside 
this central plan, and the financial resources required for this purpose by the 
university or college institutes or departments must also be secured. In principle, 
these research establishments are at liberty to decide whether or not they want 
to accept a research assignment under the plan. (p. 14) 


He then added: “Of course, if they decline, they will receive no funds from 
the budget of the Interministerial Coordinating Committee” (p. 14). This 
particular research director reported that he had obtained substantial 
funding for participating in three national-level research projects. 


Sweden has adopted a policy of central planning and coordination 
coupled with the concept of mobile funds. Research funds are allocated to 
projects rather than to institutions or to the appointment of full-time 
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researchers (Marklund, 1977). The funding of research in Sweden is 
relatively high, and the achievements of research are regarded favourably 
by outside observers. 


In Sweden, educational research and development policy is defined by a 
special bureau for educational research and development within the 
National Board of Education and the Office of Chancellor of Universities. 
As Marklund (1977) has stated: 

The major question in planning and coordination of educational research and 
development activities at the national level is how to allocate available 
earmarked resources. (p. 24) 


An illustration of the result of this planning in Sweden is provided by the 
distribution of funds for different types of research in 1976-77: 


9% — basic research 
70% — policy-oriented research and development at the preschool, 
school, adult education and teacher training levels 
21% — policy-oriented research in higher education 


The 70 percent allocated to policy-oriented research at other than the higher 
education level was broken down further as follows: preschool, 5%; 
compulsory, 12%; post-compulsory, 12%; adult, 138%; general educational 
problems, 15%; and, planning and information, 7%. Research priorities are 
determined cooperatively by researchers and policy makers. 


The centralized, coordinated approach will find particular favour with 
those who prefer highly rationalized structures and procedures. Other 
countries would serve as models for those who prefer more decentralized 
and less direct forms of coordination. Two which will be described briefly 
are the United Kingdom (England, Wales, Scotland, and Northern Ireland) 
and Norway. . 


In the United Kingdom, coordination of educational research and 
development is decentralized to some extent. Then within that 
decentralization, the preference for informal understandings as opposed to 
formal agreements and mechanisms is as evident in educational research as 
it is in other endeavours. An OECD (1974) report described the bodies which 
promote research in different sectors of England and Wales as follows: 

Fundamental research carried out in universities is financed essentially by the 
University Grants Committee and, to a lesser extent, by the Social Science 
Research Council; the Schools Council commissions most of the projects dealing 
with programmes and evaluation; the National Council for Educational 
Technology is responsible for research carried out in this particular field. A 
substantial part of the activities of the NFER (National Foundation for 
Educational Research) is devoted to research and development. (p. 46) 


The report states that “there is a wealth of interconnections between them 
but no formal coordinating mechanisms” (p. 46). These interconnections are 
achieved, in part, by having governing boards which include 
representatives of the Department of Education and Science, the research 
and development community, teachers and local education authorities. 

The situation in Norway is similar and appears to be the result of a 
deliberate policy to diversify and pluralize support of educational research 
as was suggested by the OECD (1974) report. The purpose of the various 
sources is to have different value structures giving direction and support to 
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different types of educational research. It is assumed that the universities 
will tend to support fundamental research and that the Ministry of 
Education will favour applied research and development, but far from doing 
so exclusively. Sources of funding, therefore, include regular university 
appropriations, the Research Council for Science and Humanities, the State 
Council for Experiments in Education, and special funds within the budget 
of the Ministry. The report states: 

. no formal boundaries are drawn up between the field of operation of the 
various financial bodies, and practical coordination is ensured by considerable 
overlap of individuals participating. No attempt is made to ensure consistency, 
as different value structures are permitted to operate simultaneously. A 
researcher turned down by one agency because of its particular criteria of 

| selection may be accepted by another operating on different criteria. (p. 46) 

The policy which gives this freedom to research is extended even to the 
research agency. For example, the research staff of the Institute for 
Educational Research at the University of Oslo have half-time for research 
work. The projects are not run by the Institute but by the staff members 
who personally apply for grants from research funds or do research on 
commission (Carelli & Sachsenmeier, 1977, p. 134). 


Clearly, the centrally directed, formally coordinated and the 
decentralized, informally coordinated approaches represent two alternative 
models for organizing educational research. Each appears to be satisfactory 
to at least some of the parties involved, and the prerequisites for 
effectiveness, if any, could be determined. Models such as these could be 
applied at either the national or the state and provincial levels. 


Substantive Research Areas 


The substantive areas of research in different countries are determined 
by various factors such as academic traditions; political or social pressures 
which either prevent or encourage particular kinds of studies; awareness of 
social, economic and political factors affecting education; as well as 
availability of funding for research in particular areas (OECD, 1974, p. 48). 
Some of these emphases have already been alluded to in previous mention 
of the stages of research. 


Although a broad range of problems is pursued within any country, 
there do appear to be some general trends such as the following: 

1. projects in psychology and sociology relevant to education represent a 
large part of European research; 

2. history of education, philosophy of education and comparative education 
are still important areas of research in France, Germany and Italy; and, 

3. in the Netherlands, the United Kingdom and the Scandinavian countries 
research frequently focuses on school curricula, evaluation and learning 
processes in schools. 


In some instances, research emphases are related to proposed or current 
reforms in education, particularly in the Northern European countries more 
so than in the Southern ones. 

Research in the Eastern European countries seems to span the full range 
from traditional academic and _ socio-psychological research to 
developmental projects closely associated with educational changes. Among 
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the research projects mentioned in recent literature (Carelli & 
Sachsenmeier, 1977) are compensatory preschool education in Hungary, 
lifelong education in Poland, and curricula for polytechnical education in 
the German Democratic Republic. 


The director of one research institute in the USSR (Kashin, 1977) 
described a recent research thrust as follows: 
Theoretical and practical elucidation of problems concerning the content of 
education in schools has been entrusted to our Research Institute for the 
Content and Methods of Education, which forms part of the USSR Academy of 
Educational Sciences. At present 263 research workers and staff of our Institute 
(not counting ancillary staff) are engaged on curricular improvement of all 
subjects in accordance with scientific principles. Scientific laboratories headed 
by the greatest methods specialists in the country have been set up for each 
subject. In addition, the Institute has several laboratories to study particular 
problems, such as the level of the pupils’ knowledge, arts education, education in 
the natural and mathematical sciences, optional courses, etc. (p. 54) 
He indicated that future direction of research would focus on methods: 
... the main efforts of our Institute over the next five years will be directed 
towards the theoretical elucidation of the problem of teaching methods and their 
application in accordance with the requirements of life. We realize that this is 
far from being a simple problem. (p. 62) 


The director did not elaborate on the exact definition of teaching methods 
nor on the direction which the research is likely to take. The research thrust 
does reflect the attention being given to teaching methods by the Academy 
of Pedagogical Sciences which addressed the improvement and 
modernization of teaching methods at its general meeting in December of 
1976 (Innovation, 1977, p. 2). 


Trends 


The sources to which reference has been made in this paper suggest that 
educational research in Europe is in varying stages of transition from the 
traditional academic research emphasis to the more applied, developmental 
types of research activities. This transition does not always proceed 
smoothly and calmly. Malmquist (1975, p. 59) has pointed out that there is 
some concern on the part of researchers in the Federal Republic of Germany 
about the shift from the free research model to that in which research serves 
policy makers. In spite of these and other concerns, there do appear to be 
moves in Europe toward: 

— increased emphasis on applied educational research; 

— closer links between researchers and practitioners both at the operational and 
the policy making levels; 

— involvement of a broader range of talent in educational research including 
teachers and graduate students; 

— acceptance of some form of coordination of educational research; 

— focus on problems which are closely linked to educational development plans. 


These changes prompt a number of questions about the future of 
educational research. First, what impact will cooperation between 
researcher and client have on the nature of educational research? In 
particular, what will be the effect on basic or fundamental research? 
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Second, will certain forms of coordination prove to be superior to others? 
Third, will researchers be able to produce the types of results that will 
continue to be satisfying to both the client and researcher? These types of 
problems and possibilities suggest that the developments in the European 
countries and elsewhere should be monitored over time. 


Some Persistent Problems 


A review of trends in educational research in other countries reveals the 
diversity of approaches that can be taken to solving specific problems or to 
working towards particular objectives. Clearly, there are no simple solutions 
to problems of funding, to improving the status of educational research, to 
changing interactions between researchers and clients, or to coordinating 
educational research activities. The review does suggest some alternative 
models and strategies that might be considered in relation to various 
problems. 


If educational research in Canada is to make significant contributions to 
educational improvement, the search for alternative solutions to persistent 
problems must be continued. Of the various problem areas, three to which 
renewed attention could be given are: (1) the relationship between 
researcher and practising educator; (2) the degree of specialization in 
educational research; and (3) the preparation and continuing education of 
researchers. These are three aspects in which the present state of 
educational research in Canada is not entirely consistent with the 
characteristics of Stage II educational research. As Holdaway (1976, p. 15) 
suggested, approaches to solving similar problems elsewhere may suggest 
alternatives that merit serious consideration. 


Relating Research to Practice 


The record of the impact of educational research on practice is not 
particularly impressive, either in North America or elsewhere. Dramatic 
changes in education have seldom (never?) resulted solely from research 
activities. Educational research has made its contribution, but that 
contribution has been made gradually and subtly. Whether or not there 
could be more productive associations between researchers and clients than 
now exist in Canada has not really been tested. Although in recent years 
some organizational arrangements have been developed to facilitate 
communication between researcher and practitioner, both parties have been 
cautious about the relationship and have tried to ensure that proper 
distance be maintained. 


Many educational researchers prefer to work in the relative comfort of 
the university or similar organization which provides security and 
protection. Researchers would like to have access to generous levels of 
funding for projects which they are free to choose. Researchers might like to 
influence practices in schools or educational policies but are not anxious to 
have their research efforts directed by some outside agency. Research 
results are marketed through publications and conferences; often 
researchers are disappointed that their products find no buyers or may be 
startled, when they do receive attention, by the implications which 
consumers draw from research results. For the most part, Canadian 
educational researchers operate within a relatively closed system and 
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seldom engage in dialogue either with those who fund research or those who 
use the results of research. 


For their part, many practitioners and policy makers appear content to 
carry out their activities without being well informed about the 
contributions that research might be able to make. Policy makers are 
reluctant to reveal policy changes which they are considering or to enlist 
the aid of researchers in assessing alternatives and in conducting some 
advance studies. Too many changes are decided on the basis of political 
appeal, frequently without a concerted effort to determine their educational 
merits or practicability. Curricular changes, policies on school buildings, 
examination practices, and the content of teacher education may all be 
affected by hastily conceived policy decisions. Sometimes the policy maker 
will enlist the researcher through commissioned research or requests for 
proposals which are announced on short notice and which may have 
unrealistic time or budget limits. The researcher is placed in a “take it or 
leave it” position. Such ventures may serve only to reduce the probability of 
expanded future collaboration. 


Initiatives to remedy the situation could, and probably should, come 
from both sides. The community of educational researchers could undertake 
to inform those who make educational policy or who advise on educational 
policy what linkages might be developed as part of an overall research 
policy. The policy advisors could take the initiative to inform themselves 
about the contributions which educational research can make and the 
alternative structures through which those contributions could be made. 


In addition to these general discussions, more researchers could concern 
themselves with the issues and problems that have direct implications for 
the fields of policy and practice. The initiatives of.individual researchers 
might do much to create more favourable working relationships between the 
producers and the consumers of research. An illustration of what might be 
done is provided by Yates (1977) in the context of some research projects 
carried out in England: 


What I am advocating is that we should explore the possibilities of carrying out 
research projects not on behalf of administrators and teachers but rather in 
cooperation with them. Instead of the consumers being required first of all to put 
forward questions for research and then pay heed to the results that it yields but 
leaving the conduct of the research in the meantime to the professional 
researchers, what is envisaged is a partnership that persists throughout the 
exercise. (p. 44) 


According to the report of the Director, the experience of the National 
Foundation for Educational Research in England and Wales of involving 
teaching staff in conducting a number of research projects apparently has 
been quite successful. This action research approach has demonstrated that 
the technical skills required for designing specific research projects can be 
provided by a small core of researchers while the actual conduct of the 
project is in the hands of the teachers themselves. A collaborative approach 
reduces problems of dissemination of research results. 


The search for functional and effective linking mechanisms between 
educational researchers and the users of research merits continued 
attention. Rather than merely trying to communicate more effectively with 
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practitioners, researchers should also be prepared to consider different types 
of relationships. 


Specialization in Research 


Two of the characteristics of Stage II educational research and 
development are (1) improved training of researchers, and (2) a greater 
probability of multidisciplinary research. The question arises as to whether 
the two are compatible. Increased training of researchers probably leads to 
higher degrees of specialization in narrowly-defined areas. Discovering 
increasing numbers of problems to research within those areas is likely to 
lead to decreased communication and interaction with researchers in other 
areas of specialization. Decreased communication will decrease the 
probability of multidisciplinary research. If this is indeed the case, then one 
of the characteristics of Stage II research and development may be difficult 
to achieve without special effort. 


The effects, both positive and negative, of high degrees of specialization 
in educational research are relatively easy to observe in Canada and the 
United States. On the one hand, there have been notable achievements 
within the various areas of specialization; this has contributed to the 
international recognition of researchers in North America. We see also the 
negative consequence: specialized journals which are read only by selected 
researchers; interest groups which attract only people with like interests; 
and production of research results which find increasingly less direct 
application in the educator’s world. 


In order to counteract the dysfunctional consequence of specialization, 
special efforts should be made to ensure that there is research of an inter- or 
multidisciplinary nature. In North America, this means first of all crossing 
some of the disciplinary boundaries within educational research in relation 
to major research and development projects. Thus, to take an obvious 
example, an interdisciplinary curriculum project should include not just 
curriculum experts but also educational psychologists, educational 
sociologists, and educational administrators. Such collaboration has the 
potential of resulting in a better definition of the problem and in outcomes 
which are more likely to be applicable in schools. From time to time 
researchers need to remind themselves that problems encountered by 
practitioners do not lend themselves to neat divisions according to the 
boundaries which have been established within the field of educational 
research. 


Even if the dysfunctional effects of disciplinary boundaries can be 
reduced, the problem of linking educational research and development to 
other areas of study still exists. Not only have sub-specializations within 
educational research become insulated from each other, but educational 
research has also become isolated from other institutional areas. The 
possible merits of interdisciplinary studies is brought to attention by an 
example in a different social and cultural setting. As Wrocynski (1977) 
stated: 

Being a socialist country, Poland attaches considerable importance to 
interdisciplinary studies in education because the development of education and 
schooling is closely linked with the pace of development in all fields of our 
socialist culture and economy. We are currently implementing an ambitious 
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program of educational reform aimed at making secondary school universal. 

These changes determine the direction of interdisciplinary studies . . . (p. 30) 
Two other areas in which interdisciplinary studies have been carried out 
include forecasting the development of the educational system and lifelong 
education. 


An illustration of the general approach taken to interdisciplinary studies 
in Poland is outlined by Wrocynski (1977) in relation to a symposium on 
lifelong education: 

The above-mentioned problems requiring the cooperation of educationists with 
representatives of other sciences in particular with psychologists, physiologists 
and anthropologists, are fundamental for the design of a system of life long 
education. An educational reform cannot limit itself to organizational matters. 


Wrocynski described the groups represented at the symposium: 

. representatives of different sciences, directors of educational institutes and 
heads of enterprises, who discussed the implications of lifelong education for 
schools. The problems were discussed from the points of view of: aims of 
education, content of education, teaching methods, the biological and 
psychological development of a child, development of mental creativity, 
vocational training and the role of the place of work. . . Our experience allows us 
to draw conclusions regarding the methodology of further complex studies 
involving cooperation with practitioners and their role in the verification of 
theoretical opinions. (p. 35) 


Of course, the symposium was not that unique, and there is no information 
on the extent to which subsequent research was of an interdisciplinary 
nature. Nevertheless, the intent is clear and may have had a different 
outcome from one in which a group of educational researchers sets out with 
the intention of solving a problem or developing a pon without input from 
researchers in other fields. 


The point of this discussion is not whether specialization is necessary or 
desirable but rather about our apparent inability to take steps to offset some 
of the dysfunctional aspects of high degrees of specialization. Educational 
research may be too fragmented and too insular. Consideration should be 
given to increasing the interaction among educational researchers as well 
as with researchers in other fields. 


Improving the Researcher 


Analyses of the state of educational research and development usually 
lead to prescriptions for improving the quantity, the quality, or practical 
relevance of that activity. Although these prescriptions may touch on all 
facets of research, they have one striking feature in that they all seem to 
emphasize organizational, managerial or technical changes which would 
improve research. Only passing mention is made of the researcher, of his 
values and outlook, and of his ability to be self-directing rather than 
directed by the organizational framework which surrounds him with 
constraints. 

Proposals for the improvement of research and development which focus 
on the need for national policies, for increased funding, for new structures, 
and for different linkages do, of course, identify highly important factors; 
however, these factors form only the environment of research activities. At 
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best they can have only an indirect effect on the quality and quantity of 
research produced; these will be determined much more directly by the 
persons who actually conduct the research. Regrettably, little attention is 
given as to how researchers might be helped to improve regardless of the 
organizational structures within which they may find themselves. 


Too little attention seems to be given, in proposals for improving 
research, to the initial training and further education of researchers; 
furthermore, training may be inappropriately defined. The Malmquist study 
(1975, p. 241) of educational research in Europe found that the expressed 
need for further training was concentrated in areas related to the processing 
of data, such as statistics, research methods, mathematics, and computer 
science; that is, further training needs were identified as technical training 
needs. No mention is made by Malmquist of preparing researchers to make 
decisions about selection of research problems, establishing research 
priorities, or assessing the practical significance of research results. 


There is evidence that some researchers, at least, need assistance with 
selection of research problems. For example, Buch (1974) reported that in 
India, research and development of testing instruments attracted more 
attention than did the problem of causes of failure in school. As he stated: 

A large number of studies have been undertaken in the area of educational 
evaluation and examinations. A good deal of research has been devoted to 
constructing standardized achievement tests. A few research projects have 
studied the pyschometric properties of examinations. But the periodic analyses 
of examinations conducted by the State Boards of Education reveal the ghastly 
fact of more than fifty percent of the students failing in public examinations. 
Even this has not evoked studies which may throw light on this state of affairs. 
(p. 23) 
The apparent misdirection of research efforts is not unique to India or other 
developing countries. We have examples much closer to home of where 
educational researchers have for too long neglected educational problems 
which were not defined as problems for research. 


One solution to the problem of directing the researchers’ attention to the 
“right” topics is to impose controls; that is, to set priorities externally or to 
provide incentives through funding. An alternative might be to rely more on 
self-direction on the part of the researcher by including problem 
identification — not just problem definition and delimitation — into 
preparation and continuing education programs. In most fields of study 
there appears to be no continuing process which would evaluate the 
importance of the problems being studied. Reviews of research tend to focus 
more on compilation of summaries, on comparisons, and on technical 
quality of the research rather than on directions which research might take. 


Another aspect of preparation that merits attention is suggested by 
some data reported by Malmquist (1975, pp. 121-122). When members of a 
committee were asked to judge research proposals for funded projects, a 
definite geographical bias was observed: 84 percent of proposals emanating 
from the home cities of committee members were judged acceptable, while 
only 33 percent of proposals submitted by researchers in other cities were 
judged acceptable. This leads Malmquist to wonder about the capability of 
scientists to apply “the same standards of objectivity and rationality in the 
judgement of their own affairs as in their research work” (p. 121). 
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Perhaps this will suffice to support the suggestion that preparation of 
researchers must be broader than training in the acquisition of technical 
research skills. Initial and continuing education should include topics such 
as the goals of educational research and development, the philosophy of 
science, the role of the scientist in society and the impact of personal values. 
Of course, to propose that such education would compensate for the 
inadequacies of organizational arrangements and insufficient funding 
would be foolish, but then so is assuming that structural changes alone will 
remedy deficiencies in educational research. 


Conclusion 


The brief glimpses into the state of educational research in other 
countries which were presented in various parts of this paper provide some 
support for the suggestion that educational researchers should become 
much more familiar with developments elsewhere than they tend to be. 
Researchers are well aware of the need to monitor at the international level 
the progress of research in substantive areas; however, this interest needs to 
be extended to the conduct of educational research in general. Of value 
would be more information about the insights into the varying definitions 
of educational research, the role of educational researchers, the linkages 
between educational research and policy, and related matters. 


Perspectives on these developments are difficult to obtain from the 
literature. Reports by those who have visited other countries are an 
improvement; reports by those who actually work in those other 
environments would be even better. Educational research associations in 
Canada and the United States could facilitate the exchange of information 
by regularly inviting researchers from other countries to discuss with us the 
state of educational research in their educational system. For example, it 
would be informative and helpful to know how educational researchers cope 
with the constraints imposed by a central plan, how they respond to the 
demands under a policy of mobile funds, and with the way in which the 
activities of research institutes are linked to educational reforms. It would 
be all too easy to adopt the view that others are “trying to catch up” and 
that nothing is to be learned from them; this would be an erroneous and 
unproductive perspective. There appears to be a high degree of similarity in 
the problems which confront educational research in different countries; the 
solutions differ. Researchers and those who make policy about educational 
research might be well-advised to learn more about these solutions. 


This paper is based on notes prepared for a presentation to the Annual Meeting of the American Educational 
Research Association in Toronto on March 30, 1978. The helpful comments of E.A. Holdaway on an earlier draft 
are gratefully acknowledged. 
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BOOK REVIEWS 


THINKING WITH THE TEACHABLE MACHINE. By John S. Andreae. 
London, England: Academic Press Inc., 1977, 178 pp., $15.25. (ISBN 
0-12-050050-1) 


In the prologue of this book the author states, “Fifteen years of research 
with the one aim of designing a system which can learn anything ‘from 
scratch’ has taken me from simple ideas to simpler ones.” The result of 
these efforts is PURR-PUSS, about which Andreae _ rhapsodizes, 
“PURR-PUSS .. . has been found capable of choice, of original actions, of 
counting, of substitution and recursion, of setting her own goals and of 
having them set for her. PURR-PUSS can be taught in any language, can 
be given any size of memory and can interact with the real world. She has 
no known fundamental limitation. She becomes her experience.” 


This passage will astound Artificial Intelligence (AI) researchers. Has 
Andreae actually managed to realize that simplistic dream of the fifties, the 
automaton which evolves toward sapience through “self-organization” 
alone? Were researchers too hasty in accepting the view that a computer 
cannot be programmed to acquire a cognitive skill until the computational 
mechanisms required for that skill are clearly understood? As it turns out, 
there is no need to recant that very sensible view. Orthodox AI researchers 
may safely return to their study of specific mechanisms for solving 
problems, proving theorems, understanding language, manipulating blocks, 
and so forth. Andreae offers neither experimental nor theoretical evidence 
that PURR-PUSS will ever perform intellectually nontrivial tasks. With 
PURR-PUSS, as of yore with the Homeostat and Perceptron, optimistic 
extrapolations require a prodigious inductive leap: if PURR-PUSS can learn 
simple tasks from scratch then she(!) can learn anything from scratch. 

What is PURR-PUSS? It is essentially a sequence-extrapolation program 
which bases its predictions and actions on recurrent subsequences of 
symbols. For example, PURR-PUSS would predict the pair of symbols P4 
A4 after seeing the sequence 

P1 Al P2 A2P3A3P4A4P1A1P2A2P3A3... 
because the triple of pairs P1 Al P2 A2 P3 A3 was previously followed by P4 
A4. There are actually five such predictors working concurrently. The first, 
called main-PUSS, operates as just illustrated. Of the others, pattern-PUSS 
is responsive to patterns only (i.e., the Pi), action-PUSS to actions only (1.e., 
the Ai), physical-PUSS to action-pattern pairs in which the action is 


BL 


syntactically marked as “physical,” and speech-PUSS to actions 
syntactically marked as “speech.”’ Each watches for recurrent n-tuples (n = 
3, 4, or 5, depending on the PUSS) of the events to which it is responsive so 
as to predict the next such event. All patterns are supplied by “Teacher.” 
Actions are supplied by PURR-PUSS or by Teacher at PURR-PUSS’s 
discretion. 

PURR-PUSS’s basic decision strategy is to generate action sequences 
which previously led to an unexpected pattern. In that sense, it is 
‘“novelty-seeking.” (However, it should be kept in mind that a “pattern” is 
an unanalyzed character string and that “expectations” are based on 
recurrent n-tuples). For example, when the sequence 


P1 Al P2A2P3A3P4A4P1A1P2A2P3A8P4... 


has been seen, pattern-PUSS anticipates Pl as the pattern following the 
next action. Since P1 was not previously predicted as a successor to patterns 
P1 P2 P3 P4, PURR-PUSS regards it as “novel” and so seeks out this 
“novelty” by choosing action A4, which previously led to Pl. Similarly 
longer paths toward “novel” patterns can be followed, by chaining forward 
pattern-PUSS predictions. There are some further features and refinements. 
For example, actions are priority-ranked depending on whether or not a 
path to a novel event is being followed, and which of the secondary PUSSes 
concur with the prediction of main-PUSS. Equal-priority choices are made 
randomly. Also Teacher can signal disapproval of an action, to inhibit 
future selection of that action as a successor to the preceding n-tuple. 


This sums up chapters 1-4, which also include many implementation 
details, illustrative examples and anecdotes. Evidently Andreae wants the 
book to be comprehensible and appealing to non-computer scientists, among 
whom he perhaps expects to find a more receptive readership. For example, 
he devotes several pages to explaining well-known methods of hash-coding 
n-tuples of symbols. (Incomprehensibly, he equates triply redundant 
hash-coding with holographic storage.) In another section, Andreae 
recounts how he found a periodicity in an electronic coin-tossing machine 
by compiling conditional event tables with the aid of PUSS. Though the tale 
is entertaining, it turns out that Andreae, not PUSS, discovered that every 
third toss duplicates its predecessor. Indeed, though Andreae makes no 
mention of the fact, PUSS is inherently incapable of discovering this 
regularity. 

This illustrates a fundamental weakness in Andreae’s prediction scheme: 
its inability to generalize over sequences and their relationships, i.e., its 
inability to form concepts. Its learning is of a trivial sort, incapable of 
coping even with simple statistical trends, let alone regularities requiring 
nonfinite-state computations (e.g., 10110111011110111110...). PURR-PUSS 
superficially resembles a ‘production system” a la Newell & Simon (Human 
Problem Solving, Prentice-Hall, 1972), but completely lacks the usual 
flexibility of production format and function. 


In chapters 5-7, PURR-PUSS is taught such tasks as “‘counting” (setting 
a register bit to 1 for each bead encountered on an imaginary 3-bead, 5-place 
abacus), saying HELLO MISTER so-and-so HOW DO YOU DO in response 
to any one of 3 “face” descriptions (e.g., the description of MISTER 
TRIANGLE FACE: < FACING> [ BOTMLEFT < WEDGE>[ UPACROSS 
<HAT>[ DNACROSS <WEDGE >), and following fixed routes in a 
discrete maze of 16 adjoining squares. Each teaching session consists of 
several hundred pattern-action pairs. Some of the _ pattern-action 
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subsequences are “speech.” To the extent that “speech” consists of recurrent 
n-tuples of words, PURR-PUSS can recognize and duplicate previously 
encountered phrases, and use them to trigger other expectations and 
actions. After detailing his teaching of the maze task, Andreae remarks “... 
we have seen that PURR-PUSS can set her own goals in a simple way.” By 
this he means that in the middle of an action sequence prompted by a 
verbal request, PURR-PUSS sometimes veers off and does something else. 
The reason is, of course, that the action sequence may contain an n-tuple 
which was part of another action sequence taught earlier. This phenomenon 
might less kindly be described as confusing two goals. Andreae also 
mentions an earlier hardware device CAESAR but remarks enigmatically 
that “the behaviour of CAESAR when driven by PURR-PUSS was difficult 
to describe with the printed word” (Perhaps unprintable expletives are 
required?). About a more recent device ESAW, which uses a suction cup to 
move about chips on an array of photocells, Andreae says only that “ESAW 
is proving a rich environment for PURR-PUSS” and refers to an internal 
report for further information. 


In the Appendix, Andreae proposes a brain model called Mew-Brain 
which is based on a formal-neuron realization of the basic PUSS 
sequence-prediction mechanism. 

Of what value is this book and to whom? Its value to AI is negligible. In 
fact, its apparent relevance to that field is largely an illusion fostered by 
Andreae’s use of AI buzz-words such as “multiple context,” “chunking of 
information,” ‘hypothesis formation,” ‘resolving ambiguity” — and for 
that matter, “pattern,” “language,” and “intelligence” — in trivialized 
senses. In one of several preemptive strikes against his critics, Andreae 
declares that intelligence, like beauty, is in the eye of the beholder. Perhaps 
so, but this beholder sees no glimmer of it in PURR-PUSS. Non-computer 
scientists may find the book a pleasant diversion. They are urged, however, 
to balance their reading with Bertram Raphael’s excellent AI primer, The 
Thinking Computer (W. H. Freeman, 1976). Educators, in particular, should 
be skeptical of Andreae’s suggestion that PURR-PUSS offers insights into 
learning and teaching. The Mew-Brain proposal may be worth serious 
consideration by brain theorists. Despite its limitations, it indicates one 
way of stepping out of the electro-chemical frame of reference. The stylized 
drawings of cats by Andreae’s daughter Gillian are another redeeming 
feature. 


L. K. Schubert 
Department of Computing Science 
The University of Alberta 


VALUES AND THE FUTURE: MODELS OF COMMUNITY COLLEGE 
DEVELOPMENT. By John O. Hunter..Sherman Oaks, California: Banner 
Books International, 1977, 166 pp., $8.95. ISBN 0-89491-005-1) 


The main thesis of this book is that a greater demand for educational 
accountability by the public creates a bureaucratic value system 
(characterized by more centralization of authority) which is antithetical to a 
more autonomous value structure appropriate for the effective operation of 
community colleges. Since accountability is a reality which must be faced, 
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community colleges must identify sets of values and developmental 
strategies which will accommodate bureaucratic pressures while preserving 
as much as possible the autonomy necessary to carry out their educational 
mandates. Having established this thesis, the book proceeds to develop the 
philosophical premises upon which to base such an approach to future 
college development. Unfortunately the assumptions and arguments used to 
establish these basic underpinnings are in most cases logically inconsistent 
or based on false premises. This is particularly disappointing because the 
operating models, conclusions, and recommendations derived from such 
poorly thought-out beginnings are in many cases insightful and extremely 
useful for practising community college administrators. Put more briefly, 
the book reminds me of a well-designed building sitting on a poorly 
engineered foundation. 


As one example of this structural deficiency, Doctor Hunter readily 
admits to a preference for an Organizational Development approach over 
the Management By Objectives method of administration. However in order 
to justify his position, he finds it necessary to establish an artificial 
dichotomy between the two approaches, arguing that MBO is not 
compatible with a “natural systems” approach to management while OD 
fits very nicely with such a model. He attempts to defend such an assertion 
by stating that MBO is not suitable for organizations in the public sector 
(citing one article which agrees with this view but admitting that other 
students of organization including Drucker disagree with this contention) 
while strongly supporting the use of “natural system” (and by implication, 
OD) in educational institutions. MBO is narrowly described as paying 
insufficient attention to the values underlying an organization, as 
inhibiting the thinking-through of possible consequences arising from 
decisions, and as measuring success purely on the basis of the achievement 
of organizational goals. Most descriptions of MBO tend to differ greatly 
from the above. The implementation of MBO usually begins with the 
formulation of long range goals which are based upon fundamental 
organizational values. Job descriptions and specific objectives are only 
generated after the broad mission and environmental aspects of the 
organization are examined. Periodic reviews of consequences including 
nongoal-oriented effects are part of the overall process (see for example, 
Raia: 1974, pp. 22-32). The point of this discussion is that using an OD 
approach in educational institutions can be easily justified without the 
strained set of arguments used by the author. 


Hunter advocates a greater emphasis on nonquantitative approaches to 
instructional evaluation and argues against the use of technology as a 
substitute for the fundamental role of the teacher. Again he finds it 
necessary to base his views on a set of tenuous assumptions and poorly 
developed arguments. In this case the artificial dichotomy is between 
teaching as an art or a science. As he puts it, 

The assumption that technology would change the fundamental role of the 
teacher, however, is false. Technology makes teaching a science, whereas 
teaching has always been primarily an art. (p. 53) 

Arguing that teaching was primarily an art (a doubtful proposition) and 
must therefore always remain an art is not in accord with either logic or 
observable behaviour. One can enumerate many phenomena which were 
considered one thing in the past and have now become something else. 
Flying was only for birds. Swimming underwater was done mainly by fish. 
Mathematics, economics, and psychology were part of philosophy, now they 
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are “pure” or “social” sciences. Why then can teaching not become a 
science? 

In order to support such an assertion, Hunter uses Dewey’s aesthetic 
philosophy to show the connection between teaching and art. Both, he 
argues, are experiences, both communicate through unique languages and 
serve to create a dynamic, ordered, aesthetic and moral environment (p. 51). 
While such terms might be used to describe what occurs when someone is in 
contact with a truly talented and inspired teacher, it is doubtful whether 
most of us have had many formal learning experiences which could be 
described in such a way. Therefore, according to Hunter, we have not been 
taught. 


If the fundamental role of teaching can be defined as helping students to 
attain desired learning objectives (a definition which seems to be generally 
acceptable and which is posited only because the author provides no 
explication of this term), there is little argument with Hunteyr’s conclusion; 
again only his premises are questionable, but how this fundamental role is 
carried out can vary a great deal. The teacher can act as a communicator of 
information, as a manager of educational experiences, or in countless other 
ways. By selecting appropriate learning strategies which take into 
consideration students’ interests and objectives and instructors’ strengths 
and weaknesses, teaching becomes a mixture of art and science. 


In contrast with the previous topics, Hunter’s discussion of how a 
general education component might be integrated into vocational 
programming is very insightful. He suggests, for example, that instructors 
of general education courses meet regularly with faculty from vocational 
program areas in order to better understand the needs and interests of the 
students and the objectives of the program. Particularly persuasive is his 
argument that a liberal arts approach to general education in community 
colleges is not appropriate; that courses must be developed in conjunction 
with the vocational faculty. He believes that the future of general education 
in community colleges lies with staff development and that only by 
stimulating general scholarship rather than _ allowing more 
discipline-oriented approaches can a viable general education component be 
developed and maintained. Chapters on the impact of collective bargaining, 
evaluation, and planning also provide well thought-out policies and 
guidelines as well as practical suggestions relating to the implementation of 
such policies. 

The last chapter of the book provides additional insights into Hunter’s 
position on general education which, as it turns out, means providing 
students with a set of values. Since such a view implies that there are 
universal values (a comparison with Marxian views on this subject would 
certainly have provided a more balanced outlook on this topic), a journey 
into Kantian philosophy is required. Actually, this rationale leaves the 
reader with the impression that the author has again indulged himself in a 
philosophical tour de force which has little relationship to the posited theme 
of the book. The net effect of this propensity to philosophize is to reduce the 
effectiveness of Hunter’s many excellent observations and suggestions 
regarding the administration of community colleges. 


P. Shragge 


Grant MacEwen Community College 
Edmonton, Alberta 
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STAFF DEVELOPMENT IN HIGHER EDUCATION. Edited by Lewis 
Elton and Kerren Simmonds. Surrey, England: Society for Research into 
Higher Education, Ltd., 1977, 166 pp. 


For the Canadian academic or administrator interested in higher 
education the opportunity to obtain a perspective on the state of the art in 
Britain prompts a number of speculative questions, among them: What is 
the prevailing view of staff development? How are needs determined and by 
whom? How are programs developed and delivered? What incentives are 
exercised? What administrative facilitation is provided? How successful are 
programs in meeting objectives? How widespread is the practice? What are 
the unique features? . . . and so on. Obviously any single book which 
addresses these and other relevant questions in any detail would be a 
comprehensive, monumental piece of work. In the present case no such 
claims are made. The book is a collection of papers (or abstracts of papers) 
presented at the twelfth annual conference of the Society for Research into 
Higher Education at the University of Surrey. In its 166 pages, thirty-seven 
different presentations are summarized, the diversity of which is indicated 
by the seven thematic headings used by the editors for purposes of 
classification. 


The book apparently serves three purposes: a record of proceedings, a 
source book of persons to contact for information, and, for the more 
ambitious, an opportunity to develop an image of a system in operation. For 
the Canadian reader the purpose mentioned last, while perhaps the most 
significant, is not facilitated by the use of insular terms such as DES, 
sandwich courses, the one-off course, polytechnics and the like. However, 
effort has its rewards. 


In a short review it is not possible to comment on each included paper, 
but a few examples might help to reveal the interest potential of the 
contents. J. Stoddart of Hull College of Higher Education presented an 
introductory paper titled “Staff Development and Course Development in 
the Public Sector of Higher Education.” He noted that colleges could be 
categorized as having been, or not having been, subject to radical change of 
purpose or structure in the past ten years. In the former case (e.g., 
polytechnics), staff development is closely linked to institutional change 
and curriculum development and, consequently, is more readily accepted 
than in more stable colleges where “there is rarely any serious attempt to 
appraise staff performance... or what will be demanded of individual staff 
members in the future” (p. 5). Stoddard holds the view that administrative 
responsibility for staff development in times of change goes beyond 
facilitating the individual’s personal aspirations, even though this may 
create morale problems. “If the colleges are to fulfil their obligations and 
not be forced into creating wholesale redundancy, they need: 1. a realistic 
appraisal of present staff competencies, 2. a realistic plan for institutional 
development, 3. the identification of shortage areas and over-staffed areas, 
and 4. proposals for staff retraining” (p. 8). 


Billing attempts to reconcile the interests of the individual and of the 
institution in a working definition of staff development which appears to 
have been adopted at the conference: 

Staff development is a deliberate and continuous process involving the 
identification and discussion of present and anticipated needs of individual staff 
for furthering their job satisfaction and career prospects and of the institution 
for supporting its academic work and plans, and the implementation of 
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programmes of staff activities designed for the harmonious satisfaction of these 
needs. (p. 22) 

Reference is made to societal forces which impel change in institutions 
including constrained resources, amalgamation of colleges, the changing 
graduate job market, and centralization of planning decisions (all of which 
have a familiar ring in Canada). Billing noted the specific influence of the 
Council for National Academic Awards which confers degrees earned 
through programs of study at polytechnics. In one of its reports it requires 
staff to engage in research and consultancy as a means of maintaining 
quality. Other landmark reports between 1964 and 1975 from various 
national committees have been taken seriously by the University 
community. For example one document makes it “incumbent upon 
universities to provide training for the probationer” (p. 21). (Is it hard to 
imagine a parallel situation in Canada?) 


Warren of Leicester Polytechnic discussed the institutionalisation of 
staff development and confirmed, by an example, the extent to which this 
has progressed. 

Formulation of acceptable policy in this area depends on wide ranging 
consultation thus the Academic Board established a committee in 1974 
including the Assistant Director, the Staffing Officer, representatives of Heads 
of School, academic staff and non-academic staff, and representatives of 
relevant agencies in the Polytechnic. A policy statement on staff development 
for academic staff was adopted by the Academic Board on the advice of the Staff 
Development Committee in July 1975. (p. 28) 


The policy referred to above found expression in induction courses for 
new staff, mandatory initial teacher education courses, the Development 
Discussion procedure, and training courses for Heads of School in the 
techniques used in the development interview. The most frequently 
identified development activity was research and scholarship in one’s 
specific discipline. Other activities identified were secondments and 
exchanges, taking advanced degrees, in-service workshops and seminars. 


Little attention is given to the development of conceptual models of staff 
development or to research. Fox presented a matrix for conceptualizing staff 
development needs in terms of the experience of the staff member and the 
possible roles of a professional practitioner. Hounsell mentioned a five-year 
program of research recently initiated at the University of Lancaster. While 
the focus of the research is upon student learning, findings from the project 
are expected to have implications for educational development of both 
students and teachers. Several reports of surveys of staff and student 
perceptions of teaching, and developmental projects aimed at improving 
teaching are given. 

The general impression conveyed by the book is that staff development 
in Britain is a societal expectation which institutions have acknowledged 
and responded to by the creation of internal structures for its study and 
promotion (e.g., Institute for Research and Development in 
Post-Compulsory Education, University of Lancaster; Staff Development 
Service, Trent Polytechnic; University Teaching Methods Unit, University 
of London; Institute for Educational Technology, University of Surrey; 
Department of Educational Services, Sheffield City Polytechnic); and by 
institutionalization of staff development procedures and requirements. The 
prevailing administrative view is to relate staff development to institutional 
needs and to instructional improvement, whereas staff tend to equate it 
with discipline-based research and other scholarly activity. The 
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predominant delivery mechanisms are internal programs and formal course 
work leading to advanced qualifications. Some opportunities exist also for 
study leaves, exchanges, and secondments. 

Interest in staff development in higher education is worldwide. This was 
evidenced, in a small way, by the inclusion of presenters from Canada, 
United States, Holland, West Germany, and Australia. Apparently, 
however, in Britain there is more action behind the rhetoric than in other 
parts. 


James M. Small 

Centre for the Study of 
Postsecondary Education 
The University of Alberta 
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